0000000000019053
AUTHOR
Jose E. Adsuara
Physics-Aware Machine Learning For Geosciences And Remote Sensing
Machine learning models alone are excellent approximators, but very often do not respect the most elementary laws of physics, like mass or energy conservation, so consistency and confidence are compromised. In this paper we describe the main challenges ahead in the field, and introduce several ways to live in the Physics and machine learning interplay: encoding differential equations from data, constraining data-driven models with physics-priors and dependence constraints, improving parameterizations, emulating physical models, and blending data-driven and process-based models. This is a collective long-term AI agenda towards developing and applying algorithms capable of discovering knowled…
Down-Scaling Modis Vegetation Products with Landsat GAP Filled Surface Reflectance in Google Earth Engine
High spatial resolution vegetation products are fundamental in different fields, such as improving the understanding of crop seasonality at regional scales. Here, two new vegetation products such as the Leaf Area Index (LAI) and the Fraction of Absorbed Photosynthetically Active Radiation (FAPAR) are downscaled at continental scales. A novel HIghly Scalable Temporal Adaptive Reflectance Fusion Model (HIS-TARFM) is used to generate the gap-free time series of Landsat surface reflectance data by fusing MODIS and Landsat reflectance for the contiguous United States. An artificial neural network is trained to capture the relationship between the gap free Landsat surface reflectance and the MODI…
Nonlinear Distribution Regression for Remote Sensing Applications
In many remote sensing applications, one wants to estimate variables or parameters of interest from observations. When the target variable is available at a resolution that matches the remote sensing observations, standard algorithms, such as neural networks, random forests, or the Gaussian processes, are readily available to relate the two. However, we often encounter situations where the target variable is only available at the group level, i.e., collectively associated with a number of remotely sensed observations. This problem setting is known in statistics and machine learning as multiple instance learning (MIL) or distribution regression (DR). This article introduces a nonlinear (kern…
Inferring causal relations from observational long-term carbon and water fluxes records
AbstractLand, atmosphere and climate interact constantly and at different spatial and temporal scales. In this paper we rely on causal discovery methods to infer spatial patterns of causal relations between several key variables of the carbon and water cycles: gross primary productivity, latent heat energy flux for evaporation, surface air temperature, precipitation, soil moisture and radiation. We introduce a methodology based on the convergent cross-mapping (CCM) technique. Despite its good performance in general, CCM is sensitive to (even moderate) noise levels and hyper-parameter selection. We present a robust CCM (RCCM) that relies on temporal bootstrapping decision scores and the deri…
Estimation of the mechanical properties of the eye through the study of its vibrational modes.
Measuring the eye's mechanical properties in vivo and with minimally invasive techniques can be the key for individualized solutions to a number of eye pathologies. The development of such techniques largely relies on a computational modelling of the eyeball and, it optimally requires the synergic interplay between experimentation and numerical simulation. In Astrophysics and Geophysics the remote measurement of structural properties of the systems of their realm is performed on the basis of (helio-)seismic techniques. As a biomechanical system, the eyeball possesses normal vibrational modes encompassing rich information about its structure and mechanical properties. However, the integral a…
Discovering Differential Equations from Earth Observation Data
Modeling and understanding the Earth system is a constant and challenging scientific endeavour. When a clear mechanistic model is unavailable, complex or uncertain, learning from data can be an alternative. While machine learning has provided excellent methods for detection and retrieval, understanding the governing equations of the system from observational data seems an elusive problem. In this paper we introduce sparse regression to uncover a set of governing equations in the form of a system of ordinary differential equations (ODEs). The presented method is used to explicitly describe variable relations by identifying the most expressive and simplest ODEs explaining data to model releva…
Effect of contact lenses on ocular biometric measurements based on swept-source optical coherence tomography
ABSTRACT Purpose: To determine the reliability of swept- source optical coherence tomography in cases in which soft contact lenses cannot be removed when acquiring biometric measurements. Methods: Eight subjects were included and only one eye per participant was analyzed. Each eye was measured six times by swept-source optical coherence tomography with the IOLMaster 700 instrument (Carl Zeiss Meditec, Jena, Germany). Axial length, central corneal thickness, anterior chamber depth, lens thickness, and keratometric measurements were evaluated for the naked eye and while wearing soft contact lenses of three different powers (-1.5, -3.0, and +2.0 D). Results: There were statistically significan…
Learning main drivers of crop progress and failure in Europe with interpretable machine learning
Abstract A wide variety of methods exist nowadays to address the important problem of estimating crop yields from available remote sensing and climate data. Among the different approaches, machine learning (ML) techniques are being increasingly adopted, since they allow exploiting all the information on crop progress and environmental conditions and their relations with crop yield, achieving reliable and accurate estimations. However, interpreting the relationships learned by the ML models, and hence getting insights about the problem, remains a complex and usually unexplored task. Without accountability, confidence and trust in the ML models can be compromised. Here, we develop interpretab…
Interpretability of Recurrent Neural Networks in Remote Sensing
In this work we propose the use of Long Short-Term Memory (LSTM) Recurrent Neural Networks for multivariate time series of satellite data for crop yield estimation. Recurrent nets allow exploiting the temporal dimension efficiently, but interpretability is hampered by the typically overparameterized models. The focus of the study is to understand LSTM models by looking at the hidden units distribution, the impact of increasing network complexity, and the relative importance of the input covariates. We extracted time series of three variables describing the soil-vegetation status in agroe-cosystems -soil moisture, VOD and EVI- from optical and microwave satellites, as well as available in si…
On the equivalence between the Scheduled Relaxation Jacobi method and Richardson's non-stationary method
The Scheduled Relaxation Jacobi (SRJ) method is an extension of the classical Jacobi iterative method to solve linear systems of equations ($Au=b$) associated with elliptic problems. It inherits its robustness and accelerates its convergence rate computing a set of $P$ relaxation factors that result from a minimization problem. In a typical SRJ scheme, the former set of factors is employed in cycles of $M$ consecutive iterations until a prescribed tolerance is reached. We present the analytic form for the optimal set of relaxation factors for the case in which all of them are different, and find that the resulting algorithm is equivalent to a non-stationary generalized Richardson's method. …
Global Upscaling of the MODIS Land Cover with Google Earth Engine and Landsat Data
Image classification has become one of the most common applications in remote sensing yielding to the creation of a variety of operational thematic maps at multiple spatio-temporal scales. The information contained in these maps summarizes key characteristics related with the physical environment and provides fundamental information of the Earth for vegetation monitoring or land use status over time. However, high spatial resolution land cover maps are usually only produced for specific small regions or in an image tile. We present a general methodology to obtain a high spatial resolution land cover maps using Landsat spectral information, the powerful Google Earth Engine platform, and oper…
Convolutional Long Short-Term Memory Network for Multitemporal Cloud Detection Over Landmarks
In this work, we propose to exploit both the temporal and spatial correlations in Earth observation satellite images through deep learning methods. In particular, the combination of a U-Net convolutional neural network together with a convolutional long short-term memory (LSTM) layer is proposed. This model is applied for cloud detection on MSG/SEVIRI image time series over selected landmarks. Implementation details are provided and our proposal is compared against a standard SVM and a U-Net without the convolutional LSTM layer but including temporal information too. Experimental results show that this combination of networks exploits both the spatial and temporal dependence and provides st…
Synergistic integration of optical and microwave satellite data for crop yield estimation
Developing accurate models of crop stress, phenology and productivity is of paramount importance, given the increasing need of food. Earth observation (EO) remote sensing data provides a unique source of information to monitor crops in a temporally resolved and spatially explicit way. In this study, we propose the combination of multisensor (optical and microwave) remote sensing data for crop yield estimation and forecasting using two novel approaches. We first propose the lag between Enhanced Vegetation Index (EVI) derived from MODIS and Vegetation Optical Depth (VOD) derived from SMAP as a new joint metric combining the information from the two satellite sensors in a unique feature or des…
Machine Learning Methods for Spatial and Temporal Parameter Estimation
Monitoring vegetation with satellite remote sensing is of paramount relevance to understand the status and health of our planet. Accurate and constant monitoring of the biosphere has large societal, economical, and environmental implications, given the increasing demand of biofuels and food by the world population. The current democratization of machine learning, big data, and high processing capabilities allow us to take such endeavor in a decisive manner. This chapter proposes three novel machine learning approaches to exploit spatial, temporal, multi-sensor, and large-scale data characteristics. We show (1) the application of multi-output Gaussian processes for gap-filling time series of…