Neural Network Emulation of Synthetic Hyperspectral Sentinel-2-Like Imagery With Uncertainty
Hyperspectral satellite imagery provides highly-resolved spectral information for large areas and can provide vital information. However, only a few imaging spectrometer missions are currently in operation. Aiming to generate synthetic satellite-based hyperspectral imagery potentially covering any region, we explored the possibility of applying statistical learning, i.e. emulation. Based on the relationship of a Sentinel-2 (S2) scene and a hyperspectral HyPlant airborne image, this work demonstrates the possibility to emulate a hyperspectral S2-like image. We tested the role of different machine learning regression algorithms (MLRA) and varied the image-extracted training dataset size. We f…
Global Sensitivity Analysis of Leaf-Canopy-Atmosphere RTMs: Implications for Biophysical Variables Retrieval from Top-of-Atmosphere Radiance Data.
Knowledge of key variables driving the top of the atmosphere (TOA) radiance over a vegetated surface is an important step to derive biophysical variables from TOA radiance data, e.g., as observed by an optical satellite. Coupled leaf-canopy-atmosphere Radiative Transfer Models (RTMs) allow linking vegetation variables directly to the at-sensor TOA radiance measured. Global Sensitivity Analysis (GSA) of RTMs enables the computation of the total contribution of each input variable to the output variance. We determined the impacts of the leaf-canopy-atmosphere variables into TOA radiance using the GSA to gain insights into retrievable variables. The leaf and canopy RTM PROSAIL was coupled with…
Seasonal Mapping of Irrigated Winter Wheat Traits in Argentina with a Hybrid Retrieval Workflow Using Sentinel-2 Imagery
Earth observation offers an unprecedented opportunity to monitor intensively cultivated areas providing key support to assess fertilizer needs and crop water uptake. Routinely, vegetation traits mapping can help farmers to monitor plant development along the crop’s phenological cycle, which is particularly relevant for irrigated agricultural areas. The high spatial and temporal resolution of the Sentinel-2 (S2) multispectral instrument leverages the possibility to estimate leaf area index (LAI), canopy chlorophyll content (CCC), and vegetation water content (VWC) from space. Therefore, our study presents a hybrid retrieval workflow combining a physically-based strategy with a machine learni…
Assessing Non-Photosynthetic Cropland Biomass from Spaceborne Hyperspectral Imagery
Non-photosynthetic vegetation (NPV) biomass has been identified as a priority variable for upcoming spaceborne imaging spectroscopy missions, calling for a quantitative estimation of lignocellulosic plant material as opposed to the sole indication of surface coverage. Therefore, we propose a hybrid model for the retrieval of non-photosynthetic cropland biomass. The workflow included coupling the leaf optical model PROSPECT-PRO with the canopy reflectance model 4SAIL, which allowed us to simulate NPV biomass from carbon-based constituents (CBC) and leaf area index (LAI). PROSAIL-PRO provided a training database for a Gaussian process regression (GPR) algorithm, simulating a wide range of non…
Prototyping Crop Traits Retrieval Models for CHIME: Dimensionality Reduction Strategies Applied to PRISMA Data
In preparation for new-generation imaging spectrometer missions and the accompanying unprecedented inflow of hyperspectral data, optimized models are needed to generate vegetation traits routinely. Hybrid models, combining radiative transfer models with machine learning algorithms, are preferred, however, dealing with spectral collinearity imposes an additional challenge. In this study, we analyzed two spectral dimensionality reduction methods: principal component analysis (PCA) and band ranking (BR), embedded in a hybrid workflow for the retrieval of specific leaf area (SLA), leaf area index (LAI), canopy water content (CWC), canopy chlorophyll content (CCC), the fraction of absorbed photo…
Hybrid retrieval of crop traits from multi-temporal PRISMA hyperspectral imagery
The recently launched and upcoming hyperspectral satellite missions, featuring contiguous visible-to-shortwave infrared spectral information, are opening unprecedented opportunities for the retrieval of a broad set of vegetation traits with enhanced accuracy through novel retrieval schemes. In this framework, we exploited hyperspectral data cubes collected by the new-generation PRecursore IperSpettrale della Missione Applicativa (PRISMA) satellite of the Italian Space Agency to develop and test a hybrid retrieval workflow for crop trait mapping. Crop traits were mapped over an agricultural area in north-east Italy (Jolanda di Savoia, FE) using PRISMA images collected during the 2020 and 202…
Systematic Assessment of MODTRAN Emulators for Atmospheric Correction
Atmospheric radiative transfer models (RTMs) simulate the light propagation in the Earth's atmosphere. With the evolution of RTMs, their increase in complexity makes them impractical in routine processing such as atmospheric correction. To overcome their computational burden, standard practice is to interpolate a multidimensional lookup table (LUT) of prestored simulations. However, accurate interpolation relies on large LUTs, which still implies large computation times for their generation and interpolation. In recent years, emulation has been proposed as an alternative to LUT interpolation. Emulation approximates the RTM outputs by a statistical regression model trained with a low number …
Gaussian processes retrieval of crop traits in Google Earth Engine based on Sentinel-2 top-of-atmosphere data.
The unprecedented availability of optical satellite data in cloud-based computing platforms, such as Google Earth Engine (GEE), opens new possibilities to develop crop trait retrieval models from the local to the planetary scale. Hybrid retrieval models are of interest to run in these platforms as they combine the advantages of physically-based radiative transfer models (RTM) with the flexibility of machine learning regression algorithms. Previous research with GEE primarily relied on processing bottom-of-atmosphere (BOA) reflectance data, which requires atmospheric correction. In the present study, we implemented hybrid models directly into GEE for processing Sentinel-2 (S2) Level-1C (L1C)…
Prototyping Sentinel-2 green LAI and brown LAI products for cropland monitoring.
Abstract For agricultural applications, identification of non-photosynthetic above-ground vegetation is of great interest as it contributes to assess harvest practices, detecting crop residues or drought events, as well as to better predict the carbon, water and nutrients uptake. While the mapping of green Leaf Area Index (LAI) is well established, current operational retrieval models are not calibrated for LAI estimation over senescent, brown vegetation. This not only leads to an underestimation of LAI when crops are ripening, but is also a missed monitoring opportunity. The high spatial and temporal resolution of Sentinel-2 (S2) satellites constellation offers the possibility to estimate …
Introducing ARTMO's Machine-Learning Classification Algorithms Toolbox: Application to Plant-Type Detection in a Semi-Steppe Iranian Landscape.
Accurate plant-type (PT) detection forms an important basis for sustainable land management maintaining biodiversity and ecosystem services. In this sense, Sentinel-2 satellite images of the Copernicus program offer spatial, spectral, temporal, and radiometric characteristics with great potential for mapping and monitoring PTs. In addition, the selection of a best-performing algorithm needs to be considered for obtaining PT classification as accurate as possible . To date, no freely downloadable toolbox exists that brings the diversity of the latest supervised machine-learning classification algorithms (MLCAs) together into a single intuitive user-friendly graphical user interface (GUI). To…
Quantifying the Robustness of Vegetation Indices through Global Sensitivity Analysis of Homogeneous and Forest Leaf-Canopy Radiative Transfer Models
Vegetation indices (VIs) are widely used in optical remote sensing to estimate biophysical variables of vegetated surfaces. With the advent of spectroscopy technology, spectral bands can be combined in numerous ways to extract the desired information. This resulted in a plethora of proposed indices, designed for a diversity of applications and research purposes. However, it is not always clear whether they are sensitive to the variable of interest while at the same time, responding insensitive to confounding factors. Hence, to be able to quantify the robustness of VIs, a systematic evaluation is needed, thereby introducing a widest possible variety of biochemical and structural heterogeneit…
Quantifying Fundamental Vegetation Traits over Europe Using the Sentinel-3 OLCI Catalogue in Google Earth Engine
Thanks to the emergence of cloud-computing platforms and the ability of machine learning methods to solve prediction problems efficiently, this work presents a workflow to automate spatiotemporal mapping of essential vegetation traits from Sentinel-3 (S3) imagery. The traits included leaf chlorophyll content (LCC), leaf area index (LAI), fraction of absorbed photosynthetically active radiation (FAPAR), and fractional vegetation cover (FVC), being fundamental for assessing photosynthetic activity on Earth. The workflow involved Gaussian process regression (GPR) algorithms trained on top-of-atmosphere (TOA) radiance simulations generated by the coupled canopy radiative transfer model (RTM) SC…
FLEX/S3 Tandem Mission Performance Assessment: Evolution of the End-to-End Simulator Flex-E
An End-to-end simulator (E2ES) is a tool to evaluate the performance of a satellite mission. Once a mission is approved for operation, E2ES evolves during Phase C/D to become a supporting tool for the development and validation of the ground data processor, as well as for simulating data sets to test the Prototype and Operational Processors. FLEX-E is the E2ES of the FLEX/Sentinel-3 tandem mission, which was selected in 2015 as ESA's eighth Earth Explorer. The FLEX-E evolution implies the consolidation of all the retrieval algorithms (e.g. fluorescence, reflectance, biophysical variables), the implementation of new scientific developments, as well the improvement of the co-registration proc…
Design of a Generic 3-D Scene Generator for Passive Optical Missions and Its Implementation for the ESA’s FLEX/Sentinel-3 Tandem Mission
During the design phase of a satellite mission, end-to-end mission performance simulator (E2ES) tools allow scientists and engineers evaluating the mission concept, consolidating system technical requirements and analyzing the suitability of the implemented technical solutions and data processing algorithms. The generation of synthetic scenes is one of the core parts of an E2ES, providing scenes (ground truth) as would be observed by satellite instruments and used as reference against simulated retrieved mission products. An appropriate generation of the scene also allows assessing the performance of the ground data processing chain replacing real instrument data before the mission is in or…
Gaussian Processes Retrieval of LAI from Sentinel-2 Top-of-Atmosphere Radiance Data
Abstract Retrieval of vegetation properties from satellite and airborne optical data usually takes place after atmospheric correction, yet it is also possible to develop retrieval algorithms directly from top-of-atmosphere (TOA) radiance data. One of the key vegetation variables that can be retrieved from at-sensor TOA radiance data is leaf area index (LAI) if algorithms account for variability in atmosphere. We demonstrate the feasibility of LAI retrieval from Sentinel-2 (S2) TOA radiance data (L1C product) in a hybrid machine learning framework. To achieve this, the coupled leaf-canopy-atmosphere radiative transfer models PROSAIL-6SV were used to simulate a look-up table (LUT) of TOA radi…
Statistical Learning for End-to-End Simulations
End-to-end mission performance simulators (E2ES) are suitable tools to accelerate satellite mission development from concet to deployment. One core element of these E2ES is the generation of synthetic scenes that are observed by the various instruments of an Earth Observation mission. The generation of these scenes rely on Radiative Transfer Models (RTM) for the simulation of light interaction with the Earth surface and atmosphere. However, the execution of advanced RTMs is impractical due to their large computation burden. Classical interpolation and statistical emulation methods of pre-computed Look-Up Tables (LUT) are therefore common practice to generate synthetic scenes in a reasonable…
Top-of-Atmosphere Retrieval of Multiple Crop Traits Using Variational Heteroscedastic Gaussian Processes within a Hybrid Workflow.
In support of cropland monitoring, operational Copernicus Sentinel-2 (S2) data became available globally and can be explored for the retrieval of important crop traits. Based on a hybrid workflow, retrieval models for six essential biochemical and biophysical crop traits were developed for both S2 bottom-of-atmosphere (BOA) L2A and S2 top-of-atmosphere (TOA) L1C data. A variational heteroscedastic Gaussian process regression (VHGPR) algorithm was trained with simulations generated by the combined leaf-canopy reflectance model PROSAILat the BOA scale and further combined with the Second Simulation of a Satellite Signal in the Solar Spectrum (6SV) atmosphere model at the TOA scale. Establishe…
Optimized and automated estimation of vegetation properties: Opportunities for Sentinel-2
La Biosfera es uno de los principales sistemas que conforman la Tierra. Su estudio permite comprender la relación entre la vegetación y el ciclo del carbono y cómo éste puede ser afectado por los cambios en los niveles de CO2 y los usos de suelo. Para el estudio de estas dinámicas a escala global y local, han sido desarrollados diversos modelos que son representaciones de la realidad en una escala y complejidad más simple. Parte de las variables de entrada de estos modelos son obtenidas mediante medidas de teledetección gracias al Global Climate Observing System (GCOS), que ha determinado un conjunto de 50 variables climáticas esenciales que contribuyen a los estudios de cambio climático qu…
Retrieving and Validating Leaf and Canopy Chlorophyll Content at Moderate Resolution: A Multiscale Analysis with the Sentinel-3 OLCI Sensor
ESA’s Eighth Earth Explorer mission “FLuorescence EXplorer” (FLEX) will be dedicated to the global monitoring of the chlorophyll fluorescence emitted by vegetation. In order to properly interpret the measured fluorescence signal, essential vegetation variables need to be retrieved concomitantly. FLEX will fly in tandem formation with Sentinel-3 (S3), which conveys the Ocean and Land Color Instrument (OLCI) that is designed to characterize the atmosphere and the terrestrial vegetation at a spatial resolution of 300 m. In support of FLEX’s preparatory activities, this paper presents a first validation exercise of OLCI vegetation products against in situ data coming from the 2018 FLEXSense cam…
Mapping landscape canopy nitrogen content from space using PRISMA data
Abstract Satellite imaging spectroscopy for terrestrial applications is reaching maturity with recently launched and upcoming science-driven missions, e.g. PRecursore IperSpettrale della Missione Applicativa (PRISMA) and Environmental Mapping and Analysis Program (EnMAP), respectively. Moreover, the high-priority mission candidate Copernicus Hyperspectral Imaging Mission for the Environment (CHIME) is expected to globally provide routine hyperspectral observations to support new and enhanced services for, among others, sustainable agricultural and biodiversity management. Thanks to the provision of contiguous visible-to-shortwave infrared spectral data, hyperspectral missions open enhanced …
Emulation of Sun-Induced Fluorescence from Radiance Data Recorded by the HyPlant Airborne Imaging Spectrometer
The retrieval of sun-induced fluorescence (SIF) from hyperspectral radiance data grew to maturity with research activities around the FLuorescence EXplorer satellite mission FLEX, yet full-spectrum estimation methods such as the spectral fitting method (SFM) are computationally expensive. To bypass this computational load, this work aims to approximate the SFM-based SIF retrieval by means of statistical learning, i.e., emulation. While emulators emerged as fast surrogate models of simulators, the accuracy-speedup trade-offs are still to be analyzed when the emulation concept is applied to experimental data. We evaluated the possibility of approximating the SFM-like SIF output directly based…
Intelligent Sampling for Vegetation Nitrogen Mapping Based on Hybrid Machine Learning Algorithms
Upcoming satellite imaging spectroscopy missions will deliver spatiotemporal explicit data streams to be exploited for mapping vegetation properties, such as nitrogen (N) content. Within retrieval workflows for real-time mapping over agricultural regions, such crop-specific information products need to be derived precisely and rapidly. To allow fast processing, intelligent sampling schemes for training databases should be incorporated to establish efficient machine learning (ML) models. In this study, we implemented active learning (AL) heuristics using kernel ridge regression (KRR) to minimize and optimize a training database for variational heteroscedastic Gaussian processes regression (V…
A Survey of Active Learning for Quantifying Vegetation Traits from Terrestrial Earth Observation Data
The current exponential increase of spatiotemporally explicit data streams from satellite-based Earth observation missions offers promising opportunities for global vegetation monitoring. Intelligent sampling through active learning (AL) heuristics provides a pathway for fast inference of essential vegetation variables by means of hybrid retrieval approaches, i.e., machine learning regression algorithms trained by radiative transfer model (RTM) simulations. In this study we summarize AL theory and perform a brief systematic literature survey about AL heuristics used in the context of Earth observation regression problems over terrestrial targets. Across all relevant studies it appeared that…
Hyperspectral dimensionality reduction for biophysical variable statistical retrieval
Abstract Current and upcoming airborne and spaceborne imaging spectrometers lead to vast hyperspectral data streams. This scenario calls for automated and optimized spectral dimensionality reduction techniques to enable fast and efficient hyperspectral data processing, such as inferring vegetation properties. In preparation of next generation biophysical variable retrieval methods applicable to hyperspectral data, we present the evaluation of 11 dimensionality reduction (DR) methods in combination with advanced machine learning regression algorithms (MLRAs) for statistical variable retrieval. Two unique hyperspectral datasets were analyzed on the predictive power of DR + MLRA methods to ret…
Emulation as an Accurate Alternative to Interpolation in Sampling Radiative Transfer Codes
Computationally expensive radiative transfer models (RTMs) are widely used to realistically reproduce the light interaction with the earth surface and atmosphere. Because these models take long processing time, the common practice is to first generate a sparse look-up table (LUT) and then make use of interpolation methods to sample the multidimensional LUT input variable space. However, the question arise whether common interpolation methodsperform most accurate. As an alternative to interpolation, this paper proposes to use emulation, i.e., approximating the RTM output by means of the statistical learning. Two experiments were conducted to assess the accuracy in delivering spectral outputs…
Improving the remote estimation of soil organic carbon in complex ecosystems with Sentinel-2 and GIS using Gaussian processes regression
Abstract Background and aims The quantitative retrieval of soil organic carbon (SOC) storage, particularly for soils with a large potential for carbon sequestration, is of global interest due to its link with the carbon cycle and the mitigation of climate change. However, complex ecosystems with good soil qualities for SOC storage are poorly studied. Methods The interrelation between SOC and various vegetation remote sensing drivers is understood to demonstrate the link between the carbon stored in the vegetation layer and SOC of the top soil layers. Based on the mapping of SOC in two horizons (0–30 cm and 30–60 cm) we predict SOC with high accuracy in the complex and mountainous heterogene…
DATimeS: A machine learning time series GUI toolbox for gap-filling and vegetation phenology trends detection
Abstract Optical remotely sensed data are typically discontinuous, with missing values due to cloud cover. Consequently, gap-filling solutions are needed for accurate crop phenology characterization. The here presented Decomposition and Analysis of Time Series software (DATimeS) expands established time series interpolation methods with a diversity of advanced machine learning fitting algorithms (e.g., Gaussian Process Regression: GPR) particularly effective for the reconstruction of multiple-seasons vegetation temporal patterns. DATimeS is freely available as a powerful image time series software that generates cloud-free composite maps and captures seasonal vegetation dynamics from regula…
Quantifying vegetation biophysical variables from the Sentinel-3/FLEX tandem mission: Evaluation of the synergy of OLCI and FLORIS data sources
The ESA’s forthcoming FLuorescence EXplorer (FLEX) mission is dedicated to the global monitoring of the vegetation’s chlorophyll fluorescence by means of an imaging spectrometer, FLORIS. In order to properly interpret the fluorescence signal in relation to photosynthetic activity, essential vegetation variables need to be retrieved concomitantly. FLEX will fly in tandem with Sentinel-3 (S3), which conveys the Ocean and Land Colour Instrument (OLCI) that is designed to characterize the atmosphere and the terrestrial vegetation at a spatial resolution of 300 m. In this work we present the retrieval models of four essential biophysical variables: (1) Leaf Area Index (LAI), (2) leaf chlorophyll…
Evaluation of Hybrid Models to Estimate Chlorophyll and Nitrogen Content of Maize Crops in the Framework of the Future CHIME Mission
In the next few years, the new Copernicus Hyperspectral Imaging Mission (CHIME) is foreseen to be launched by the European Space Agency (ESA). This mission will provide an unprecedented amount of hyperspectral data, enabling new research possibilities within several fields of natural resources, including the “agriculture and food security” domain. In order to efficiently exploit this upcoming hyperspectral data stream, new processing methods and techniques need to be studied and implemented. In this work, the hybrid approach (HYB) and its variant, featuring sampling dimensionality reduction through active learning heuristics (HAL), were applied to CHIME-like data to evaluate the…