Ensuring statistical reproducibility of ocean model simulations in the age of hybrid computing

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

Novel high performance computing systems that feature hybrid architectures require large scale code refactoring to unravel underlying exploitable parallelism. Such redesign can often be accompanied with machine-precision changes as the order of computation cannot always be maintained. For chaotic systems like climate models, these round-off level differences can grow rapidly. Systematic errors may also manifest initially as machine-precision differences. Isolating genuine round off level differences from such errors remains a challenge. Here, we apply two-sample equality of distribution tests to evaluate statistical reproducibility of the ocean model component of US Department of Energy's Energy Exascale Earth System Model (E3SM). A 2-year control simulation ensemble is compared to a modified ensemble as a test case - after a known non-bit-for-bit change in a model component is introduced - to evaluate the null hypothesis that the two ensembles are statistically indistinguishable. To quantify the false negative rates of these tests, we conduct a formal power analysis using a targeted suite of short simulation ensembles. The ensemble suite contains several perturbed ensembles, each with a progressively different climate than the baseline ensemble - obtained by perturbing the magnitude of a single model tuning parameter, the Gent and McWilliams κ, in a controlled manner. The null hypothesis is evaluated for each of perturbed ensembles using these tests. The power analysis informs on the detection limits of the tests for given ensemble size allowing model developers to evaluate the impact of an introduced non-bit-for-bit change to the model.

Original languageEnglish
Title of host publicationProceedings of the Platform for Advanced Scientific Computing Conference, PASC 2021
PublisherAssociation for Computing Machinery, Inc
ISBN (Electronic)9781450385633
DOIs
StatePublished - Jul 5 2021
Event2021 Platform for Advanced Scientific Computing Conference, PASC 2021 - Virtual, Online, Switzerland
Duration: Jul 5 2021Jul 9 2021

Publication series

NameProceedings of the Platform for Advanced Scientific Computing Conference, PASC 2021

Conference

Conference2021 Platform for Advanced Scientific Computing Conference, PASC 2021
Country/TerritorySwitzerland
CityVirtual, Online
Period07/5/2107/9/21

Funding

The E3SM code that produced the simulations is open-source and can be accessed from https://e3sm.org/model. All the data used are listed in the references. This research was supported as part of the Energy Exascale Earth System Model (E3SM) project, funded by the U.S. Department of Energy, Office of Science, Office of Biological and Environmental Research. This manuscript has been authored by UT-Battelle, LLC which is supported by the Office of Science of the U.S. Department of Energy under Contract No.DE-AC05-00OR22725. This research used the resources of the Oak Ridge and Argonne Leadership Computing Facilities at the Oak Ridge and Argonne National Laboratories, respectively, and the National Energy Research Scientific Computing Center, which are supported by the Office of Science of the U.S. Department of Energy under Contracts DE-AC05-00OR22725, DE-AC02-06CH11357, and DE-AC02-05CH11231.

Keywords

  • Climate statistics
  • Hybrid computing
  • Machine learning
  • Ocean modeling
  • Solution reproducibility

Fingerprint

Dive into the research topics of 'Ensuring statistical reproducibility of ocean model simulations in the age of hybrid computing'. Together they form a unique fingerprint.

Cite this