Abstract
Effective utilization of novel hybrid architectures of pre-exascale and exascale machines requires transformations to global climate modeling systems that may not reproduce the original model solution bit-for-bit. Round-off level differences grow rapidly in these non-linear and chaotic systems. This makes it difficult to isolate bugs/errors from innocuous growth expected from round-off level differences. Here, we apply two modern multivariate two sample equality of distribution tests to evaluate statistical reproducibility of global climate model simulations using standard monthly output of short (∼ 1-year) simulation ensembles of a control model and a modified model of US Department of Energy’s Energy Exascale Earth System Model (E3SM). Both the tests are able to identify changes induced by modifications to some model tuning parameters. We also conduct formal power analyses of the tests by applying them on designed suites of short simulation ensembles each with an increasingly different climate from the control ensemble. The results are compared against those from another such test. These power analyses provide a framework to quantify the degree of differences that can be detected confidently by the tests for a given ensemble size (sample size). This will allow model developers using the tests to make an informed decision when accepting/rejecting an unintentional non-bit-for-bit change to the model solution.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the Platform for Advanced Scientific Computing Conference, PASC 2019 |
| Publisher | Association for Computing Machinery, Inc |
| ISBN (Electronic) | 9781450367707 |
| DOIs | |
| State | Published - Jun 12 2019 |
| Event | 6th Platform for Advanced Scientific Computing Conference, PASC 2019 - Zurich, Switzerland Duration: Jun 12 2019 → Jun 14 2019 |
Publication series
| Name | Proceedings of the Platform for Advanced Scientific Computing Conference, PASC 2019 |
|---|
Conference
| Conference | 6th Platform for Advanced Scientific Computing Conference, PASC 2019 |
|---|---|
| Country/Territory | Switzerland |
| City | Zurich |
| Period | 06/12/19 → 06/14/19 |
Funding
This research was supported as part of the Energy Exascale Earth System Model (E3SM) project and Climate Model Development and Validation program, funded by the U.S. Department of Energy, Office of Science, Office of Biological and Environmental Research. This manuscript has been authored by UT-Battelle, LLC and used resources of the National Center for Computational Sciences at Oak Ridge National Laboratory, both of which are supported by the Office of Science of the U.S. Department of Energy under Contract No.DE-AC05-00OR22725. E3SM simulation data output is archived at U.S. Deparment of Energy’s leadership computing facilities and can be obtained by contacting the corresponding author. This work used the R package’energy’ and the matlab package’Kernel Two Sample Test’. We are grateful to Maria Rizzo and Gabriel Szekely who developed and maintain the’energy’ package and Arthur Gretton who developed and maintains the’Kernel Two Sample Test’ package.
Keywords
- Climate models
- Multivariate statistics
- Reproducibility