6533b7dafe1ef96bd126d8ab

RESEARCH PRODUCT

Randomized kernels for large scale Earth observation applications

Gustau Camps-vallsJordi Muñoz-maríAdrian Perez-suayJulia Amorós-lópezLuis Gómez-chovaValero Laparra

subject

FOS: Computer and information sciencesEarth observationComputer Science - Machine Learning010504 meteorology & atmospheric sciencesComputer scienceRemote sensing application0211 other engineering and technologiesSoil Science02 engineering and technologycomputer.software_genre01 natural sciencesMachine Learning (cs.LG)Computers in Earth Sciences021101 geological & geomatics engineering0105 earth and related environmental sciencesRemote sensingContextual image classificationEstimation theoryHyperspectral imagingGeology15. Life on landKernel methodKernel regressionData miningComputational problemcomputer

description

Abstract Current remote sensing applications of bio-geophysical parameter estimation and image classification have to deal with an unprecedented big amount of heterogeneous and complex data sources. New satellite sensors involving a high number of improved time, space and wavelength resolutions give rise to challenging computational problems. Standard physical inversion techniques cannot cope efficiently with this new scenario. Dealing with land cover classification of the new image sources has also turned to be a complex problem requiring large amount of memory and processing time. In order to cope with these problems, statistical learning has greatly helped in the last years to develop statistical retrieval and classification models that can ingest large amounts of Earth observation data. Kernel methods constitute a family of powerful machine learning algorithms, which have found wide use in remote sensing and geosciences. However, kernel methods are still not widely adopted because of the high computational cost when dealing with large scale problems, such as the inversion of radiative transfer models or the classification of high spatial-spectral-temporal resolution data. This paper introduces to the remote sensing community an efficient kernel method for fast statistical retrieval of atmospheric and biophysical parameters and image classification problems. We rely on a recently presented approximation to shift-invariant kernels using projections on random Fourier features. The method proposes an explicit mapping function defined through a set of projections randomly sampled from the Fourier domain. It is proved to approximate the implicit mapping of a kernel function. This allows to deal with large-scale data but taking advantage of kernel methods. The method is simple, computationally very efficient in both memory and processing costs, and easily parallelizable. We show that kernel regression and classification is now possible for datasets with millions of samples. Examples on atmospheric parameter retrieval from hyperspectral infrared sounders like IASI/Metop; large scale emulation and inversion of the familiar PROSAIL radiative transfer model on Sentinel-2 data; and the identification of clouds over landmarks in time series of MSG/Seviri images show the efficiency and effectiveness of the proposed technique.

10.1016/j.rse.2017.02.009http://dx.doi.org/10.1016/j.rse.2017.02.009