Open Earth Monitor — Global Workshop 2024

Tom Hengl (OpenGeoHub)

Tom is the Director at the OpenGeoHub foundation. He has more than 20 years of experience as an environmental modeler, data scientist and spatial analyst with background in soil mapping and geo-information science. He continuously runs hands-on-R training courses to promote use of Open Source software for spatial analysis / spatial modeling purposes. He is currently the project leader of the OpenLandMap — a system for automated global soil and vegetation mapping at fine spatial resolutions (100 m, 250 m to 1 km) and which aspires to be recognized as an “OpenStreetMap-type” system for environmental data. Tom’s core core philosophy is outlined in this document (see also our Medium article on OpenLandMap.org). Tom is recipient of the Clarivate Highly Cited Researchers for 2021/2022/2023.

The speaker's profile picture

Do you accept that a video-recording of your talk is published under CC-BY license via https://av.tib.eu? – yes

Sessions

10-03
15:30
45min
Workshop: Spatiotemporal Machine Learning: fitting models and generating predictions using time-series data
Tom Hengl (OpenGeoHub)

Machine Learning is commonly used to map environmental variables in 2D, but what about generating predictions of dynamic variables such as above ground biomass, forest species, soil carbon and similar? The difference between spatiotemporal vs purely 2D / 3D mapping is in the three main aspects: (1) points and covariate layers are matched in spacetime (usually month-year period or at least year), (2) covariate layers are based on time-series data and include also accumulative indices (e.g. cumulative rainfall, cumulative snow cover, cumulative cropping fraction and similar) and derivatives, (3) during model training and validation, points are subset in both spacetime to avoid overfitting and bias in predictions. The rationale for using spatiotemporal machine learning is fitness of data for reliable time-series analysis: the predictions for anywhere in the spacetime cube need to be unbiased, with objectively quantified prediction errors (uncertainty), so that hence changes can be derived without a risk for serious over-/under-estimation. We have tested this framework on local and regional data sets (e.g. LUCAS soil samples covering 2009, 2012, 2015, 2018 for Europe) and can be now potentially applied using global compilations of soil points (https://opengeohub.github.io/SoilSamples/). Spatiotemporal machine learning could also potentially be used for predicting future states of soil, e.g. by extrapolating models to future climate scenarios and future land use systems (Bonannella et al., 2023).

OEMC project workshop
Wodak Room (IIASA)