Transfer Learning with Convolutional Networks for Atmospheric Parameter Retrieval

6533b85ffe1ef96bd12c1337

RESEARCH PRODUCT

Transfer Learning with Convolutional Networks for Atmospheric Parameter Retrieval

Allan Aasbjerg Nielsen Gustau Camps Valls Valero Laparra David Malmgren-hansen

subject

FOS: Computer and information sciences Computer Science - Machine Learning Computer science Feature extraction 0211 other engineering and technologies Tranfer learning FOS: Physical sciences 02 engineering and technology Atmospheric model Infrared atmospheric sounding interferometer computer.software_genre Convolutional neural network Machine Learning (cs.LG)0202 electrical engineering electronic engineering information engineering Infrared measurements 021101 geological & geomatics engineering Artificial neural network Statistical model Numerical weather prediction Parameter retrieval Physics - Atmospheric and Oceanic Physics Kernel method 13. Climate action Atmospheric and Oceanic Physics (physics.ao-ph)Convolutional neural networks 020201 artificial intelligence & image processing Data mining computer Curse of dimensionality

description

The Infrared Atmospheric Sounding Interferometer (IASI) on board the MetOp satellite series provides important measurements for Numerical Weather Prediction (NWP). Retrieving accurate atmospheric parameters from the raw data provided by IASI is a large challenge, but necessary in order to use the data in NWP models. Statistical models performance is compromised because of the extremely high spectral dimensionality and the high number of variables to be predicted simultaneously across the atmospheric column. All this poses a challenge for selecting and studying optimal models and processing schemes. Earlier work has shown non-linear models such as kernel methods and neural networks perform well on this task, but both schemes are computationally heavy on large quantities of data. Kernel methods do not scale well with the number of training data, and neural networks require setting critical hyperparameters. In this work we follow an alternative pathway: we study transfer learning in convolutional neural nets (CNN s) to alleviate the retraining cost by departing from proxy solutions (either features or networks) obtained from previously trained models for related variables. We show how features extracted from the IASI data by a CNN trained to predict a physical variable can be used as inputs to another statistical method designed to predict a different physical variable at low altitude. In addition, the learned parameters can be transferred to another CNN model and obtain results equivalent to those obtained when using a CNN trained from scratch requiring only fine tuning.

year	journal	country	edition	language
2018-01-01	IGARSS 2018 - 2018 IEEE International Geoscience and Remote Sensing Symposium

10.1109/igarss.2018.8518097 http://dx.doi.org/10.1109/igarss.2018.8518097