A multivariate approach to ensure statistical reproducibility of climate model simulations

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Scopus citations

Abstract

Effective utilization of novel hybrid architectures of pre-exascale and exascale machines requires transformations to global climate modeling systems that may not reproduce the original model solution bit-for-bit. Round-off level differences grow rapidly in these non-linear and chaotic systems. This makes it difficult to isolate bugs/errors from innocuous growth expected from round-off level differences. Here, we apply two modern multivariate two sample equality of distribution tests to evaluate statistical reproducibility of global climate model simulations using standard monthly output of short (∼ 1-year) simulation ensembles of a control model and a modified model of US Department of Energy’s Energy Exascale Earth System Model (E3SM). Both the tests are able to identify changes induced by modifications to some model tuning parameters. We also conduct formal power analyses of the tests by applying them on designed suites of short simulation ensembles each with an increasingly different climate from the control ensemble. The results are compared against those from another such test. These power analyses provide a framework to quantify the degree of differences that can be detected confidently by the tests for a given ensemble size (sample size). This will allow model developers using the tests to make an informed decision when accepting/rejecting an unintentional non-bit-for-bit change to the model solution.

Original languageEnglish
Title of host publicationProceedings of the Platform for Advanced Scientific Computing Conference, PASC 2019
PublisherAssociation for Computing Machinery, Inc
ISBN (Electronic)9781450367707
DOIs
StatePublished - Jun 12 2019
Event6th Platform for Advanced Scientific Computing Conference, PASC 2019 - Zurich, Switzerland
Duration: Jun 12 2019Jun 14 2019

Publication series

NameProceedings of the Platform for Advanced Scientific Computing Conference, PASC 2019

Conference

Conference6th Platform for Advanced Scientific Computing Conference, PASC 2019
Country/TerritorySwitzerland
CityZurich
Period06/12/1906/14/19

Funding

This research was supported as part of the Energy Exascale Earth System Model (E3SM) project and Climate Model Development and Validation program, funded by the U.S. Department of Energy, Office of Science, Office of Biological and Environmental Research. This manuscript has been authored by UT-Battelle, LLC and used resources of the National Center for Computational Sciences at Oak Ridge National Laboratory, both of which are supported by the Office of Science of the U.S. Department of Energy under Contract No.DE-AC05-00OR22725. E3SM simulation data output is archived at U.S. Deparment of Energy’s leadership computing facilities and can be obtained by contacting the corresponding author. This work used the R package’energy’ and the matlab package’Kernel Two Sample Test’. We are grateful to Maria Rizzo and Gabriel Szekely who developed and maintain the’energy’ package and Arthur Gretton who developed and maintains the’Kernel Two Sample Test’ package.

FundersFunder number
U.S. Department of EnergyDE-AC05-00OR22725

    Keywords

    • Climate models
    • Multivariate statistics
    • Reproducibility

    Fingerprint

    Dive into the research topics of 'A multivariate approach to ensure statistical reproducibility of climate model simulations'. Together they form a unique fingerprint.

    Cite this