Skip to main navigation Skip to search Skip to main content

Do we have globally representative data to understand soil processes?

  • Avni Malhotra
  • , Sophie F. von Fromm
  • , Ben Bond-Lamberty
  • , Sebastian Doetterl
  • , Katerina Georgiou
  • , Emily B. Graham
  • , Katherine A. Heckman
  • , Ruofei Jia
  • , Kaizad F. Patel
  • , Kenton A. Rod
  • , Fernanda Santos
  • , César Terrer
  • , Katherine Todd-Brown
  • , Jianqiu Zheng
  • , Kirsten Hofmockel
  • , Vanessa Bailey

Research output: Contribution to journalArticlepeer-review

Abstract

Understanding and modeling soils and soil organic matter (SOM) are central to a variety of human needs, from food production to ecosystem management. Soil data have been collected for over a century, but the global spatial and process representativeness of soil data remains unclear. We assessed the representativeness of currently available soil data that could be used to understand a variety of SOM processes. We used 16 open-source soil databases and data from over 281,000 unique locations globally, categorizing the databases into three main data types necessary to understand SOM processes: soil carbon stocks and fluxes, mechanistic drivers of these stocks and fluxes, and soil carbon gain or loss potential. We found that stock and driver data have extensive global coverage. However, data on soil carbon gain or loss potential, particularly data describing change in soils over time such as time series data, are severely limited in their global coverage. We conclude that while significant strides have been made in measuring soil carbon stocks and fluxes, and their drivers, we are limited in global data related to changes in soils over time. Our recommendations for soil data generators are to ensure precise metadata reporting and prioritizing sampling in underrepresented areas like tropical, arctic, mountainous, wetland and arid regions. We also encourage designing revisit schemes that explicitly support change detection and reporting multi-modal datasets that can aid in model development. Targeted measurement of low coverage soil data types and regions is necessary for a range of applications including current and future biogeochemical predictions, and their management and policy implications.

Original languageEnglish
Article number23
JournalBiogeochemistry
Volume169
Issue number2
DOIs
StatePublished - Apr 2026

Keywords

  • Carbon fluxes
  • Carbon stocks
  • Representativeness analysis
  • Soil carbon
  • Soil databases
  • Time series data

Fingerprint

Dive into the research topics of 'Do we have globally representative data to understand soil processes?'. Together they form a unique fingerprint.

Cite this