Abstract
Representing soil organic carbon (SOC) dynamics in Earth system models (ESMs) is a key source of uncertainty in predicting carbon-climate feedbacks. Machine learning models can help identify dominant environmental controllers and establish their functional relationships with SOC stocks. The resulting knowledge can be integrated into ESMs to reduce uncertainty and improve predictions of SOC dynamics over space and time. In this study, we used a large number of SOC field observations (n = 54 000), geospatial datasets of environmental factors (n = 46), and two machine learning approaches (namely random forest, RF, and generalized additive modeling, GAM) to (1) identify dominant environmental controllers of global and biome-specific SOC stocks, (2) derive functional relationships between environmental controllers and SOC stocks, and (3) compare the identified environmental controllers and predictive relationships with those in models used in Phase 6 of the Coupled Model Intercomparison Project (CMIP6). Our results showed that the diurnal temperature, drought index, cation exchange capacity, and precipitation were important observed environmental predictors of global SOC stocks. While the RF model identified 14 environmental factors that describe climatic, vegetation, and edaphic conditions as important predictors of global SOC stocks (R2 = 0.61, RMSE = 0.46 kg m-2), current ESMs oversimplify the relationships between environmental factors and SOC, with precipitation, temperature, and net primary productivity explaining > 96 % of the variability in ESM-modeled SOC stocks. Further, our study revealed notable disparities among the functional relationships between environmental factors and SOC stocks simulated by ESMs compared with observed relationships. To improve SOC representations in ESMs, it is imperative to incorporate additional environmental controls, such as the cation exchange capacity, and refine the functional relationships to align more closely with observations.
Original language | English |
---|---|
Pages (from-to) | 5173-5183 |
Number of pages | 11 |
Journal | Biogeosciences |
Volume | 21 |
Issue number | 22 |
DOIs | |
State | Published - Nov 19 2024 |
Funding
This study was jointly supported by the Laboratory Directed Research and Development program of Sandia National Laboratories and the Reducing Uncertainties in Biogeochemical Interactions through Synthesis and Computation Science Focus Area (RUBISCO SFA), which is sponsored by the Regional and Global Model Analysis (RGMA) activity of the Earth Environmental Systems Modeling (EESM) program of the Earth and Environmental Systems Sciences Division (EESSD), Office of Biological and Environmental Research (BER), within the US Department of Energy's Office of Science. Sandia National Laboratories is a multi-mission laboratory managed and operated by National Technology and Engineering Solutions of Sandia, LLC, a wholly owned subsidiary of Honeywell International, Inc., for the US Department of Energy's National Nuclear Security Administration under contract no. DE-NA-0003525. Lawrence Berkeley National Laboratory (LBNL) is managed by the Regents of the University of California for the US Department of Energy under contract no. DE-AC02-05CH11231. Oak Ridge National Laboratory (ORNL) is managed by UT-Battelle, LLC, for the US Department of Energy under contract no. DE-AC05-00OR22725. This research has been supported by the Office of Science (grant no. DE-NA-0003525).
Funders | Funder number |
---|---|
Sandia National Laboratories | |
Biological and Environmental Research | |
Laboratory Directed Research and Development program of Sandia National Laboratories | |
U.S. Department of Energy | |
Earth Environmental Systems Modeling (EESM) program | |
Office of Science | |
National Nuclear Security Administration | DE-NA-0003525 |
National Nuclear Security Administration | |
University of California | DE-AC02-05CH11231 |
University of California | |
Oak Ridge National Laboratory | DE-AC05-00OR22725 |
Oak Ridge National Laboratory |