6533b7dafe1ef96bd126d96b

RESEARCH PRODUCT

Earth system data cubes unravel global multivariate dynamics

M. D. MahechaM. D. MahechaM. D. MahechaF. GansG. BrandtR. ChristiansenS. E. CornellN. FomferraG. KraemerG. KraemerG. KraemerJ. PetersP. BodesheimP. BodesheimG. Camps-vallsJ. F. DongesJ. F. DongesW. DorigoL. M. Estupinan-suarezL. M. Estupinan-suarezV. H. Gutierrez-velezM. GutwinM. GutwinM. JungM. C. LondoñoD. G. MirallesP. PapastefanouM. ReichsteinM. ReichsteinM. Reichstein

subject

Agriculture and Food SciencesDECOMPOSITION0106 biological sciencesFLUXESDependency (UML)lcsh:Dynamic and structural geology010504 meteorology & atmospheric sciencesInterface (Java)Computer scienceDIMENSIONALITY010603 evolutionary biology01 natural sciencesESAData cube03 medical and health scienceslcsh:QE500-639.5TEMPERATURE SENSITIVITYlcsh:Science030304 developmental biology0105 earth and related environmental sciences0303 health sciencesData stream mininglcsh:QE1-996.5SCIENCEFRAMEWORKData sciencePRODUCTSlcsh:GeologyMODELEarth system scienceVariable (computer science)Workflow13. Climate actionGeneral Earth and Planetary Scienceslcsh:QSOIL RESPIRATIONCurse of dimensionality

description

Understanding Earth system dynamics in light of ongoing human intervention and dependency remains a major scientific challenge. The unprecedented availability of data streams describing different facets of the Earth now offers fundamentally new avenues to address this quest. However, several practical hurdles, especially the lack of data interoperability, limit the joint potential of these data streams. Today, many initiatives within and beyond the Earth system sciences are exploring new approaches to overcome these hurdles and meet the growing interdisciplinary need for data-intensive research; using data cubes is one promising avenue. Here, we introduce the concept of Earth system data cubes and how to operate on them in a formal way. The idea is that treating multiple data dimensions, such as spatial, temporal, variable, frequency, and other grids alike, allows effective application of user-defined functions to co-interpret Earth observations and/or model-data integration. An implementation of this concept combines analysis-ready data cubes with a suitable analytic interface. In three case studies, we demonstrate how the concept and its implementation facilitate the execution of complex workflows for research across multiple variables, and spatial and temporal scales: (1) summary statistics for ecosystem and climate dynamics; (2) intrinsic dimensionality analysis on multiple timescales; and (3) model-data integration. We discuss the emerging perspectives for investigating global interacting and coupled phenomena in observed or simulated data. In particular, we see many emerging perspectives of this approach<span idCombining double low line"page202"/> for interpreting large-scale model ensembles. The latest developments in machine learning, causal inference, and model-data integration can be seamlessly implemented in the proposed framework, supporting rapid progress in data-intensive research across disciplinary boundaries.

10.5194/esd-11-201-2020https://hdl.handle.net/21.11116/0000-0005-BF8D-621.11116/0000-0004-D3BC-A