0000000000158291

AUTHOR

Adrian Perez-suay

showing 32 related works from this author

Physics-Aware Machine Learning For Geosciences And Remote Sensing

2021

Machine learning models alone are excellent approximators, but very often do not respect the most elementary laws of physics, like mass or energy conservation, so consistency and confidence are compromised. In this paper we describe the main challenges ahead in the field, and introduce several ways to live in the Physics and machine learning interplay: encoding differential equations from data, constraining data-driven models with physics-priors and dependence constraints, improving parameterizations, emulating physical models, and blending data-driven and process-based models. This is a collective long-term AI agenda towards developing and applying algorithms capable of discovering knowled…

Physics010504 meteorology & atmospheric sciencesMathematical modelbusiness.industry0211 other engineering and technologies02 engineering and technologyMachine learningcomputer.software_genre01 natural sciencesField (computer science)Data modelingEnergy conservationEarth system scienceConsistency (database systems)Encoding (memory)Artificial intelligencebusinesscomputerGeology021101 geological & geomatics engineering0105 earth and related environmental sciencesPhysical lawIGARSS 2021 - 2021 IEEE International Geoscience and Remote Sensing Symposium, Brussels, Belgium 2021
researchProduct

Neural Network Emulation of Synthetic Hyperspectral Sentinel-2-Like Imagery With Uncertainty

2023

Hyperspectral satellite imagery provides highly-resolved spectral information for large areas and can provide vital information. However, only a few imaging spectrometer missions are currently in operation. Aiming to generate synthetic satellite-based hyperspectral imagery potentially covering any region, we explored the possibility of applying statistical learning, i.e. emulation. Based on the relationship of a Sentinel-2 (S2) scene and a hyperspectral HyPlant airborne image, this work demonstrates the possibility to emulate a hyperspectral S2-like image. We tested the role of different machine learning regression algorithms (MLRA) and varied the image-extracted training dataset size. We f…

Atmospheric Scienceddc:520Computers in Earth SciencesIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
researchProduct

Nonlinear Distribution Regression for Remote Sensing Applications

2020

In many remote sensing applications, one wants to estimate variables or parameters of interest from observations. When the target variable is available at a resolution that matches the remote sensing observations, standard algorithms, such as neural networks, random forests, or the Gaussian processes, are readily available to relate the two. However, we often encounter situations where the target variable is only available at the group level, i.e., collectively associated with a number of remotely sensed observations. This problem setting is known in statistics and machine learning as multiple instance learning (MIL) or distribution regression (DR). This article introduces a nonlinear (kern…

Signal Processing (eess.SP)FOS: Computer and information sciencesComputer Science - Machine LearningArtificial neural networkRemote sensing applicationComputer science0211 other engineering and technologies02 engineering and technologyLeast squaresRandom forestMachine Learning (cs.LG)Kernel (linear algebra)symbols.namesakeKernel (statistics)symbolsFOS: Electrical engineering electronic engineering information engineeringGeneral Earth and Planetary SciencesElectrical Engineering and Systems Science - Signal ProcessingElectrical and Electronic EngineeringGaussian processAlgorithm021101 geological & geomatics engineeringCurse of dimensionalityIEEE Transactions on Geoscience and Remote Sensing
researchProduct

Causal Inference in Geoscience and Remote Sensing From Observational Data

2020

Establishing causal relations between random variables from observational data is perhaps the most important challenge in today’s science. In remote sensing and geosciences, this is of special relevance to better understand the earth’s system and the complex interactions between the governing processes. In this paper, we focus on an observational causal inference, and thus, we try to estimate the correct direction of causation using a finite set of empirical data. In addition, we focus on the more complex bivariate scenario that requires strong assumptions and no conditional independence tests can be used. In particular, we explore the framework of (nondeterministic) additive noise models, …

Signal Processing (eess.SP)FOS: Computer and information sciencesComputer Science - Machine LearningEarth science0211 other engineering and technologiesEstimatorRegression analysis02 engineering and technologyBivariate analysisMachine Learning (cs.LG)Methodology (stat.ME)Nondeterministic algorithmConditional independence13. Climate actionCausal inferenceFOS: Electrical engineering electronic engineering information engineeringGeneral Earth and Planetary SciencesElectrical Engineering and Systems Science - Signal ProcessingElectrical and Electronic EngineeringSpurious relationshipStatistics - MethodologyIndependence (probability theory)021101 geological & geomatics engineeringRemote sensingIEEE Transactions on Geoscience and Remote Sensing
researchProduct

Nonlinear Cook distance for Anomalous Change Detection

2020

In this work we propose a method to find anomalous changes in remote sensing images based on the chronochrome approach. A regressor between images is used to discover the most {\em influential points} in the observed data. Typically, the pixels with largest residuals are decided to be anomalous changes. In order to find the anomalous pixels we consider the Cook distance and propose its nonlinear extension using random Fourier features as an efficient nonlinear measure of impact. Good empirical performance is shown over different multispectral images both visually and quantitatively evaluated with ROC curves.

FOS: Computer and information sciencesComputer Science - Machine LearningComputer scienceComputer Vision and Pattern Recognition (cs.CV)Multispectral imageComputer Science - Computer Vision and Pattern Recognition0211 other engineering and technologies02 engineering and technologyMeasure (mathematics)Machine Learning (cs.LG)Kernel (linear algebra)symbols.namesake0502 economics and businessCook's distance021101 geological & geomatics engineering050208 financePixelbusiness.industry05 social sciencesPattern recognitionNonlinear systemFourier transformKernel (image processing)Computer Science::Computer Vision and Pattern RecognitionsymbolsArtificial intelligencebusinessChange detection
researchProduct

Randomized kernels for large scale Earth observation applications

2020

Abstract Current remote sensing applications of bio-geophysical parameter estimation and image classification have to deal with an unprecedented big amount of heterogeneous and complex data sources. New satellite sensors involving a high number of improved time, space and wavelength resolutions give rise to challenging computational problems. Standard physical inversion techniques cannot cope efficiently with this new scenario. Dealing with land cover classification of the new image sources has also turned to be a complex problem requiring large amount of memory and processing time. In order to cope with these problems, statistical learning has greatly helped in the last years to develop st…

FOS: Computer and information sciencesEarth observationComputer Science - Machine Learning010504 meteorology & atmospheric sciencesComputer scienceRemote sensing application0211 other engineering and technologiesSoil Science02 engineering and technologycomputer.software_genre01 natural sciencesMachine Learning (cs.LG)Computers in Earth Sciences021101 geological & geomatics engineering0105 earth and related environmental sciencesRemote sensingContextual image classificationEstimation theoryHyperspectral imagingGeology15. Life on landKernel methodKernel regressionData miningComputational problemcomputerRemote Sensing of Environment
researchProduct

Pattern Recognition Scheme for Large-Scale Cloud Detection over Landmarks

2020

Landmark recognition and matching is a critical step in many Image Navigation and Registration (INR) models for geostationary satellite services, as well as to maintain the geometric quality assessment (GQA) in the instrument data processing chain of Earth observation satellites. Matching the landmark accurately is of paramount relevance, and the process can be strongly impacted by the cloud contamination of a given landmark. This paper introduces a complete pattern recognition methodology able to detect the presence of clouds over landmarks using Meteosat Second Generation (MSG) data. The methodology is based on the ensemble combination of dedicated support vector machines (SVMs) dependent…

FOS: Computer and information sciencesAtmospheric ScienceMatching (statistics)Computer Science - Machine LearningSource code010504 meteorology & atmospheric sciencesComputer scienceComputer Vision and Pattern Recognition (cs.CV)media_common.quotation_subjectMultispectral image0211 other engineering and technologiesComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONComputer Science - Computer Vision and Pattern RecognitionCloud computing02 engineering and technology01 natural sciencesMachine Learning (cs.LG)Computers in Earth Sciences021101 geological & geomatics engineering0105 earth and related environmental sciencesmedia_commonLandmarkbusiness.industryPattern recognitionSupport vector machinePattern recognition (psychology)Geostationary orbitArtificial intelligencebusiness
researchProduct

About Combining Metric Learning and Prototype Generation

2014

Distance metric learning has been a major research topic in recent times. Usually, the problem is formulated as finding a Mahalanobis-like metric matrix that satisfies a set of constraints as much as possible. Different ways to introduce these constraints and to effectively formulate and solve the optimization problem have been proposed. In this work, we start with one of these formulations that leads to a convex optimization problem and generalize it in order to increase the efficiency by appropriately selecting the set of constraints. Moreover, the original criterion is expressed in terms of a reduced set of representatives that is learnt together with the metric. This leads to further im…

Set (abstract data type)Matrix (mathematics)Mathematical optimizationOptimization problemmedia_common.quotation_subjectMetric (mathematics)Convex optimizationQuality (business)Equivalence of metricsMathematicsMetric k-centermedia_common
researchProduct

Remote Sensing Image Classification with Large Scale Gaussian Processes

2017

Current remote sensing image classification problems have to deal with an unprecedented amount of heterogeneous and complex data sources. Upcoming missions will soon provide large data streams that will make land cover/use classification difficult. Machine learning classifiers can help at this, and many methods are currently available. A popular kernel classifier is the Gaussian process classifier (GPC), since it approaches the classification problem with a solid probabilistic treatment, thus yielding confidence intervals for the predictions as well as very competitive results to state-of-the-art neural networks and support vector machines. However, its computational cost is prohibitive for…

FOS: Computer and information sciences010504 meteorology & atmospheric sciencesComputer scienceMultispectral image0211 other engineering and technologiesMachine Learning (stat.ML)02 engineering and technologyLand cover01 natural sciencesStatistics - ApplicationsMachine Learning (cs.LG)Kernel (linear algebra)Bayes' theoremsymbols.namesakeStatistics - Machine LearningApplications (stat.AP)Electrical and Electronic EngineeringGaussian process021101 geological & geomatics engineering0105 earth and related environmental sciencesRemote sensingContextual image classificationArtificial neural networkData stream miningProbabilistic logicSupport vector machineComputer Science - LearningKernel (image processing)symbolsGeneral Earth and Planetary Sciences
researchProduct

Randomized Rx For Target Detection

2018

This work tackles the target detection problem through the well-known global RX method. The RX method models the clutter as a multivariate Gaussian distribution, and has been extended to nonlinear distributions using kernel methods. While the kernel RX can cope with complex clutters, it requires a considerable amount of computational resources as the number of clutter pixels gets larger. Here we propose random Fourier features to approximate the Gaussian kernel in kernel RX and consequently our development keep the accuracy of the nonlinearity while reducing the computational cost which is now controlled by an hyperparameter. Results over both synthetic and real-world image target detection…

FOS: Computer and information sciencesHyperparameter020301 aerospace & aeronauticsComputer Science - Machine LearningComputer scienceComputer Vision and Pattern Recognition (cs.CV)0211 other engineering and technologiesComputer Science - Computer Vision and Pattern RecognitionMultivariate normal distribution02 engineering and technologyObject detectionMachine Learning (cs.LG)symbols.namesakeKernel (linear algebra)Kernel method0203 mechanical engineeringKernel (statistics)Gaussian functionsymbolsClutterAnomaly detectionAlgorithm021101 geological & geomatics engineering
researchProduct

Consistent Regression of Biophysical Parameters with Kernel Methods

2020

This paper introduces a novel statistical regression framework that allows the incorporation of consistency constraints. A linear and nonlinear (kernel-based) formulation are introduced, and both imply closed-form analytical solutions. The models exploit all the information from a set of drivers while being maximally independent of a set of auxiliary, protected variables. We successfully illustrate the performance in the estimation of chlorophyll content.

FOS: Computer and information sciencesMathematical optimizationComputer Science - Machine Learning010504 meteorology & atmospheric sciences0211 other engineering and technologiesRegression analysisMachine Learning (stat.ML)02 engineering and technology01 natural sciencesRegressionData modelingMachine Learning (cs.LG)Set (abstract data type)Methodology (stat.ME)Nonlinear systemKernel methodConsistency (statistics)Statistics - Machine LearningKernel (statistics)Statistics - Methodology021101 geological & geomatics engineering0105 earth and related environmental sciencesMathematicsIGARSS 2018 - 2018 IEEE International Geoscience and Remote Sensing Symposium
researchProduct

Causal inference in geosciences with kernel sensitivity maps

2020

Establishing causal relations between random variables from observational data is perhaps the most important challenge in today's Science. In remote sensing and geosciences this is of special relevance to better understand the Earth's system and the complex and elusive interactions between processes. In this paper we explore a framework to derive cause-effect relations from pairs of variables via regression and dependence estimation. We propose to focus on the sensitivity (curvature) of the dependence estimator to account for the asymmetry of the forward and inverse densities of approximation residuals. Results in a large collection of 28 geoscience causal inference problems demonstrate the…

Signal Processing (eess.SP)FOS: Computer and information sciencesComputer Science - Machine Learning010504 meteorology & atmospheric sciences0211 other engineering and technologiesInverseEstimator02 engineering and technologycomputer.software_genre01 natural sciencesMachine Learning (cs.LG)Methodology (stat.ME)Kernel (statistics)Causal inferenceFOS: Electrical engineering electronic engineering information engineeringRelevance (information retrieval)Data miningSensitivity (control systems)Electrical Engineering and Systems Science - Signal ProcessingFocus (optics)computerRandom variableStatistics - Methodology021101 geological & geomatics engineering0105 earth and related environmental sciences
researchProduct

Discovering Differential Equations from Earth Observation Data

2020

Modeling and understanding the Earth system is a constant and challenging scientific endeavour. When a clear mechanistic model is unavailable, complex or uncertain, learning from data can be an alternative. While machine learning has provided excellent methods for detection and retrieval, understanding the governing equations of the system from observational data seems an elusive problem. In this paper we introduce sparse regression to uncover a set of governing equations in the form of a system of ordinary differential equations (ODEs). The presented method is used to explicitly describe variable relations by identifying the most expressive and simplest ODEs explaining data to model releva…

0301 basic medicineEarth observationTheoretical computer scienceComputer scienceDifferential equationOde020206 networking & telecommunications02 engineering and technologyData modeling03 medical and health sciences030104 developmental biologyOrdinary differential equation0202 electrical engineering electronic engineering information engineeringConstant (mathematics)Variable (mathematics)IGARSS 2020 - 2020 IEEE International Geoscience and Remote Sensing Symposium
researchProduct

Kernel methods and their derivatives: Concept and perspectives for the earth system sciences.

2020

Kernel methods are powerful machine learning techniques which implement generic non-linear functions to solve complex tasks in a simple way. They Have a solid mathematical background and exhibit excellent performance in practice. However, kernel machines are still considered black-box models as the feature mapping is not directly accessible and difficult to interpret.The aim of this work is to show that it is indeed possible to interpret the functions learned by various kernel methods is intuitive despite their complexity. Specifically, we show that derivatives of these functions have a simple mathematical formulation, are easy to compute, and can be applied to many different problems. We n…

FOS: Computer and information sciencesComputer Science - Machine LearningSupport Vector MachineTheoretical computer scienceComputer scienceEntropyKernel FunctionsNormal Distribution0211 other engineering and technologies02 engineering and technologyMachine Learning (cs.LG)Machine LearningStatistics - Machine LearningSimple (abstract algebra)0202 electrical engineering electronic engineering information engineeringOperator TheoryData ManagementMultidisciplinaryGeographyApplied MathematicsSimulation and ModelingQRDensity estimationKernel methodKernel (statistics)Physical SciencessymbolsMedicine020201 artificial intelligence & image processingAlgorithmsResearch ArticleComputer and Information SciencesScienceMachine Learning (stat.ML)Research and Analysis MethodsKernel MethodsKernel (linear algebra)symbols.namesakeArtificial IntelligenceSupport Vector MachinesHumansEntropy (information theory)Computer SimulationGaussian process021101 geological & geomatics engineeringData VisualizationCorrectionRandom VariablesFunction (mathematics)Probability TheorySupport vector machineAlgebraPhysical GeographyLinear AlgebraEarth SciencesEigenvectorsRandom variableMathematicsEarth SystemsPLoS ONE
researchProduct

Scaling Up a Metric Learning Algorithm for Image Recognition and Representation

2008

Maximally Collapsing Metric Learning is a recently proposed algorithm to estimate a metric matrix from labelled data. The purpose of this work is to extend this approach by considering a set of landmark points which can in principle reduce the cost per iteration in one order of magnitude. The proposal is in fact a generalized version of the original algorithm that can be applied to larger amounts of higher dimensional data. Exhaustive experimentation shows that very similar behavior at a lower cost is obtained for a wide range of the number of landmark points used.

Clustering high-dimensional dataSet (abstract data type)Range (mathematics)LandmarkMetric (mathematics)Landmark pointRepresentation (mathematics)AlgorithmFacial recognition systemMathematics
researchProduct

A Random Extension for Discriminative Dimensionality Reduction and Metric Learning

2009

A recently proposed metric learning algorithm which enforces the optimal discrimination of the different classes is extended and empirically assessed using different kinds of publicly available data. The optimization problem is posed in terms of landmark points and then, a stochastic approach is followed in order to bypass some of the problems of the original algorithm. According to the results, both computational burden and generalization ability are improved while absolute performance results remain almost unchanged.

LandmarkOptimization problemDiscriminative modelbusiness.industryGeneralizationPopulation-based incremental learningDimensionality reductionMetric (mathematics)Pattern recognitionExtension (predicate logic)Artificial intelligencebusinessMathematics
researchProduct

Fair Kernel Learning

2017

New social and economic activities massively exploit big data and machine learning algorithms to do inference on people’s lives. Applications include automatic curricula evaluation, wage determination, and risk assessment for credits and loans. Recently, many governments and institutions have raised concerns about the lack of fairness, equity and ethics in machine learning to treat these problems. It has been shown that not including sensitive features that bias fairness, such as gender or race, is not enough to mitigate the discrimination when other related features are included. Instead, including fairness in the objective function has been shown to be more efficient.

Equity (economics)Actuarial scienceComputingMilieux_THECOMPUTINGPROFESSIONExploitComputer sciencebusiness.industrymedia_common.quotation_subjectDimensionality reductionBig dataWageInference02 engineering and technologyKernel method020204 information systems0202 electrical engineering electronic engineering information engineering020201 artificial intelligence & image processingbusinessCurriculummedia_common
researchProduct

Interpretability of Recurrent Neural Networks in Remote Sensing

2020

In this work we propose the use of Long Short-Term Memory (LSTM) Recurrent Neural Networks for multivariate time series of satellite data for crop yield estimation. Recurrent nets allow exploiting the temporal dimension efficiently, but interpretability is hampered by the typically overparameterized models. The focus of the study is to understand LSTM models by looking at the hidden units distribution, the impact of increasing network complexity, and the relative importance of the input covariates. We extracted time series of three variables describing the soil-vegetation status in agroe-cosystems -soil moisture, VOD and EVI- from optical and microwave satellites, as well as available in si…

2. Zero hungerMultivariate statisticsNetwork complexity010504 meteorology & atmospheric sciencesComputer science0211 other engineering and technologies02 engineering and technology15. Life on landcomputer.software_genre01 natural sciencesRecurrent neural networkDimension (vector space)Redundancy (engineering)Relevance (information retrieval)Data miningTime seriesWater contentcomputer021101 geological & geomatics engineering0105 earth and related environmental sciencesInterpretabilityIGARSS 2020 - 2020 IEEE International Geoscience and Remote Sensing Symposium
researchProduct

Convolutional Long Short-Term Memory Network for Multitemporal Cloud Detection Over Landmarks

2019

In this work, we propose to exploit both the temporal and spatial correlations in Earth observation satellite images through deep learning methods. In particular, the combination of a U-Net convolutional neural network together with a convolutional long short-term memory (LSTM) layer is proposed. This model is applied for cloud detection on MSG/SEVIRI image time series over selected landmarks. Implementation details are provided and our proposal is compared against a standard SVM and a U-Net without the convolutional LSTM layer but including temporal information too. Experimental results show that this combination of networks exploits both the spatial and temporal dependence and provides st…

business.industryComputer scienceDeep learning0211 other engineering and technologiesCloud detectionPattern recognition02 engineering and technology010501 environmental sciences01 natural sciencesConvolutional neural networkImage (mathematics)Support vector machineLong short term memoryArtificial intelligenceLayer (object-oriented design)business021101 geological & geomatics engineering0105 earth and related environmental sciencesIGARSS 2019 - 2019 IEEE International Geoscience and Remote Sensing Symposium
researchProduct

Synergistic integration of optical and microwave satellite data for crop yield estimation

2019

Developing accurate models of crop stress, phenology and productivity is of paramount importance, given the increasing need of food. Earth observation (EO) remote sensing data provides a unique source of information to monitor crops in a temporally resolved and spatially explicit way. In this study, we propose the combination of multisensor (optical and microwave) remote sensing data for crop yield estimation and forecasting using two novel approaches. We first propose the lag between Enhanced Vegetation Index (EVI) derived from MODIS and Vegetation Optical Depth (VOD) derived from SMAP as a new joint metric combining the information from the two satellite sensors in a unique feature or des…

Signal Processing (eess.SP)FOS: Computer and information sciencesEarth observationCoefficient of determinationTeledetecció010504 meteorology & atmospheric sciencesEnhanced vegetation index0208 environmental biotechnologyFOS: Physical sciencesSoil Science02 engineering and technologyStatistics - Applications01 natural sciencesArticleModerate resolution imaging spectroradiometer (MODIS)Robustness (computer science)Machine learningLinear regressionFOS: Electrical engineering electronic engineering information engineeringFeature (machine learning)Kernel ridge regressionCrop yield estimationVegetation optical depthApplications (stat.AP)Electrical Engineering and Systems Science - Signal ProcessingComputers in Earth Sciences0105 earth and related environmental sciencesRemote sensingMathematics2. Zero hungerCrop yieldProcessos estocàsticsGeologyEnhanced vegetation indexAgro-ecosystems020801 environmental engineeringPhysics - Data Analysis Statistics and ProbabilityMetric (mathematics)Soil moisture active passive (SMAP)Data Analysis Statistics and Probability (physics.data-an)Imatges Processament
researchProduct

An Online Metric Learning Approach through Margin Maximization

2011

This work introduces a method based on learning similarity measures between pairs of objects in any representation space that allows to develop convenient recognition algorithms. The problem is formulated through margin maximization over distance values so that it can discriminate between similar (intra-class) and dissimilar (inter-class) elements without enforcing positive definiteness of the metric matrix as in most competing approaches. A passive-aggressive approach has been adopted to carry out the corresponding optimization procedure. The proposed approach has been empirically compared to state of the art metric learning on several publicly available databases showing its potential bot…

Similarity (geometry)business.industryComputationDimensionality reductionSemi-supervised learningMachine learningcomputer.software_genrek-nearest neighbors algorithmPositive definitenessMetric (mathematics)Artificial intelligenceRepresentation (mathematics)businesscomputerMathematics
researchProduct

Online Metric Learning Methods Using Soft Margins and Least Squares Formulations

2012

Online metric learning using margin maximization has been introduced as a way to learn appropriate dissimilarity measures in an efficient way when information as pairs of examples is given to the learning system in a progressive way. These schemes have several practical advantages with regard to global ones in which a training set needs to be processed. On the other hand, they may suffer from a poor performance depending on the quality of the examples and the particular tuning or other implementation details. This paper formulates several online metric learning alternatives using a passive-aggressive schema. A new formulation of the online problem using least squares is also introduced. The…

Mathematical optimizationTraining setbusiness.industrymedia_common.quotation_subjectMachine learningcomputer.software_genreLeast squaresSchema (genetic algorithms)Margin maximizationMetric (mathematics)Learning methodsQuality (business)Artificial intelligencebusinesscomputerMathematicsmedia_common
researchProduct

Flipped evaluation: herramientas online para la evaluación participativa

2021

[EN] The evaluation of a subject is a fundamental part of the teaching-learning process and one of the main concerns of our students. This is a complex task that requires a lot of effort from the teacher. This is a growing effort in line with the increased weight of con-tinuous evaluation in the current educational system. In this work, different methodo-logies focused on maximizing the student’s performance are presented, thus minimizing the extra effort for the teacher in the evaluation process. We provide several examples of activities throught Moodle platform such as the workshop, glossary, databases, ques-tionnaires, etc. Some of them allow self-assessment once configured, whereas others pr…

Innovación educativaMoodle toolsEvaluaciónQuestionnaireEducación superiorEnseñanza superiorHerramientas de MoodleTecnologías y educaciónTallerEvaluationWorkshopCuestionario
researchProduct

Efficient Nonlinear RX Anomaly Detectors

2020

Current anomaly detection algorithms are typically challenged by either accuracy or efficiency. More accurate nonlinear detectors are typically slow and not scalable. In this letter, we propose two families of techniques to improve the efficiency of the standard kernel Reed-Xiaoli (RX) method for anomaly detection by approximating the kernel function with either {\em data-independent} random Fourier features or {\em data-dependent} basis with the Nystr\"om approach. We compare all methods for both real multi- and hyperspectral images. We show that the proposed efficient methods have a lower computational cost and they perform similar (or outperform) the standard kernel RX algorithm thanks t…

FOS: Computer and information sciencesComputer Science - Machine LearningBasis (linear algebra)Computer scienceComputer Vision and Pattern Recognition (cs.CV)Image and Video Processing (eess.IV)Computer Science - Computer Vision and Pattern Recognition0211 other engineering and technologiesApproximation algorithmHyperspectral imaging02 engineering and technologyElectrical Engineering and Systems Science - Image and Video ProcessingGeotechnical Engineering and Engineering GeologyRegularization (mathematics)Machine Learning (cs.LG)Nonlinear systemKernel (linear algebra)Kernel (statistics)FOS: Electrical engineering electronic engineering information engineeringAnomaly detectionElectrical and Electronic EngineeringAnomaly (physics)Algorithm021101 geological & geomatics engineeringIEEE Geoscience and Remote Sensing Letters
researchProduct

Estrategia de enseñanza y aprendizaje de programación basada en la idea de ’hackathon’

2021

[EN] The acquisition of programming and data analysis skills in higher education is increa-singly necessary in all areas of Science and Engineering. In this paper we present a methodology for the motivation of programming learning, mainly focused on the deve-lopment of machine learning algorithms. This methodology is based on the hackathon idea and will have different levels. On the one hand the basic level where a competition is proposed in an improvised way during the development of the class. A second level where a programmed hackathon is proposed but within the classroom environment and using learning management systems such as Moodle. The last level consists of parti-cipation in an exte…

Innovación educativaMotivationEvaluaciónParticipationProgramaciónParticipaciónEducación superiorProgrammingEnseñanza superiorDatathonMotivaciónTecnologías y educaciónHackathonEvaluation
researchProduct

Sensitivity Maps of the Hilbert-Schmidt Independence Criterion

2018

Abstract Kernel dependence measures yield accurate estimates of nonlinear relations between random variables, and they are also endorsed with solid theoretical properties and convergence rates. Besides, the empirical estimates are easy to compute in closed form just involving linear algebra operations. However, they are hampered by two important problems: the high computational cost involved, as two kernel matrices of the sample size have to be computed and stored, and the interpretability of the measure, which remains hidden behind the implicit feature map. We here address these two issues. We introduce the sensitivity maps (SMs) for the Hilbert–Schmidt independence criterion (HSIC). Sensi…

Mathematical optimization0211 other engineering and technologiesFeature selection02 engineering and technology010501 environmental sciences01 natural sciencesMeasure (mathematics)Kernel methodKernel (statistics)Linear algebraApplied mathematicsSensitivity (control systems)Random variableSoftwareIndependence (probability theory)021101 geological & geomatics engineering0105 earth and related environmental sciencesMathematicsApplied Soft Computing
researchProduct

Efficient Kernel Cook's Distance for Remote Sensing Anomalous Change Detection

2021

Detecting anomalous changes in remote sensing images is a challenging problem, where many approaches and techniques have been presented so far. We rely on the standard field of multivariate statistics of diagnostic measures, which are concerned about the characterization of distributions, detection of anomalies, extreme events, and changes. One useful tool to detect multivariate anomalies is the celebrated Cook's distance. Instead of assuming a linear relationship, we present a novel kernelized version of the Cook's distance to address anomalous change detection in remote sensing images. Due to the large computational burden involved in the direct kernelization, and the lack of out-…

Atmospheric ScienceMultivariate statisticsComputer scienceMultispectral image0211 other engineering and technologies02 engineering and technology010501 environmental sciences01 natural sciencesField (computer science)13. Climate actionKernel (statistics)KernelizationLeverage (statistics)Computers in Earth SciencesCook's distanceChange detection021101 geological & geomatics engineering0105 earth and related environmental sciencesRemote sensing
researchProduct

Passive millimeter wave image classification with large scale Gaussian processes

2017

Passive Millimeter Wave Images (PMMWIs) are being increasingly used to identify and localize objects concealed under clothing. Taking into account the quality of these images and the unknown position, shape, and size of the hidden objects, large data sets are required to build successful classification/detection systems. Kernel methods, in particular Gaussian Processes (GPs), are sound, flexible, and popular techniques to address supervised learning problems. Unfortunately, their computational cost is known to be prohibitive for large scale applications. In this work, we present a novel approach to PMMWI classification based on the use of Gaussian Processes for large data sets. The proposed…

HyperparameterContextual image classificationbusiness.industryComputer scienceSupervised learning0211 other engineering and technologiesInferencePattern recognition02 engineering and technologysymbols.namesakeBayes' theoremKernel (linear algebra)Kernel methodKernel (statistics)0202 electrical engineering electronic engineering information engineeringsymbols020201 artificial intelligence & image processingArtificial intelligencebusinessGaussian process021101 geological & geomatics engineering2017 IEEE International Conference on Image Processing (ICIP)
researchProduct

Machine Learning Methods for Spatial and Temporal Parameter Estimation

2020

Monitoring vegetation with satellite remote sensing is of paramount relevance to understand the status and health of our planet. Accurate and constant monitoring of the biosphere has large societal, economical, and environmental implications, given the increasing demand of biofuels and food by the world population. The current democratization of machine learning, big data, and high processing capabilities allow us to take such endeavor in a decisive manner. This chapter proposes three novel machine learning approaches to exploit spatial, temporal, multi-sensor, and large-scale data characteristics. We show (1) the application of multi-output Gaussian processes for gap-filling time series of…

business.industryEstimation theoryComputer scienceBig dataBiosphereVegetationMachine learningcomputer.software_genreRandom forestsymbols.namesakeKernel (statistics)symbolsArtificial intelligenceScale (map)businessGaussian processcomputer
researchProduct

Warped Gaussian Processes in Remote Sensing Parameter Estimation and Causal Inference

2018

This letter introduces warped Gaussian process (WGP) regression in remote sensing applications. WGP models output observations as a parametric nonlinear transformation of a GP. The parameters of such a prior model are then learned via standard maximum likelihood. We show the good performance of the proposed model for the estimation of oceanic chlorophyll content from multispectral data, vegetation parameters (chlorophyll, leaf area index, and fractional vegetation cover) from hyperspectral data, and in the detection of the causal direction in a collection of 28 bivariate geoscience and remote sensing causal problems. The model consistently performs better than the standard GP and the more a…

FOS: Computer and information sciencesComputer Science - Machine LearningHeteroscedasticityRemote sensing applicationComputer scienceComputer Vision and Pattern Recognition (cs.CV)Maximum likelihoodComputer Science - Computer Vision and Pattern Recognition0211 other engineering and technologies02 engineering and technologyBivariate analysis010501 environmental sciences01 natural sciencesMachine Learning (cs.LG)Data modelingsymbols.namesakeElectrical and Electronic EngineeringGaussian process021101 geological & geomatics engineering0105 earth and related environmental sciencesRemote sensingParametric statisticsEstimation theoryHyperspectral imagingGeotechnical Engineering and Engineering GeologyConfidence intervalCausal inferencesymbolsIEEE Geoscience and Remote Sensing Letters
researchProduct

HyperLabelMe : A Web Platform for Benchmarking Remote-Sensing Image Classifiers

2017

HyperLabelMe is a web platform that allows the automatic benchmarking of remote-sensing image classifiers. To demonstrate this platform's attributes, we collected and harmonized a large data set of labeled multispectral and hyperspectral images with different numbers of classes, dimensionality, noise sources, and levels. The registered user can download training data pairs (spectra and land cover/use labels) and submit the predictions for unseen testing spectra. The system then evaluates the accuracy and robustness of the classifier, and it reports different scores as well as a ranked list of the best methods and users. The system is modular, scalable, and ever-growing in data sets and clas…

General Computer ScienceContextual image classificationComputer scienceMultispectral imageRegistered user020206 networking & telecommunications02 engineering and technologyBenchmarkingcomputer.software_genreData setStatistical classificationComputingMethodologies_PATTERNRECOGNITIONRobustness (computer science)ITC-ISI-JOURNAL-ARTICLE0202 electrical engineering electronic engineering information engineeringGeneral Earth and Planetary Sciences020201 artificial intelligence & image processingData miningElectrical and Electronic EngineeringInstrumentationcomputerClassifier (UML)IEEE Geoscience and Remote Sensing Magazine
researchProduct

Efficient remote sensing image classification with Gaussian processes and Fourier features

2017

This paper presents an efficient methodology for approximating kernel functions in Gaussian process classification (GPC). Two models are introduced. We first include the standard random Fourier features (RFF) approximation into GPC, which largely improves the computational efficiency and permits large scale remote sensing data classification. In addition, we develop a novel approach which avoids randomly sampling a number of Fourier frequencies, and alternatively learns the optimal ones using a variational Bayes approach. The performance of the proposed methods is illustrated in complex problems of cloud detection from multispectral imagery.

010504 meteorology & atmospheric sciencesContextual image classificationComputer scienceMultispectral imageData classification0211 other engineering and technologiesSampling (statistics)02 engineering and technology01 natural sciencessymbols.namesakeBayes' theoremFourier transformKernel (statistics)symbolsGaussian process021101 geological & geomatics engineering0105 earth and related environmental sciencesRemote sensing
researchProduct