Abstract
The Energy Exascale Earth System Model (E3SM) Project is an ongoing, state-of-the-science Earth system modeling, simulation, and prediction project that targets efficient utilization of U.S. Department of Energy's (DOE) supercomputers to meet the science needs of the nation and the mission needs of DOE. This work focuses on our early evaluation of the A64FX architecture on Fugaku supercomputer using E3SM benchmarks. We will present results that track hardware trends, facilitate architecture comparison and the specific impact on our workload using an atmospheric model benchmark. We have two variants of the code written in Fortran and C++/Kokkos respectively which were used to collect data on a variety of CPU and GPU platforms. Furthermore, we have conducted a comparative evaluation of the compilers on the A64FX architecture and found GNU to be the best performer for our workload. Our experience so far indicates that Fugaku/A64FX shows promising energy efficiency (performance/Watt) with further performance gains possible through architecture-aware optimization efforts.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - 2021 IEEE International Conference on Cluster Computing, Cluster 2021 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 719-727 |
| Number of pages | 9 |
| ISBN (Electronic) | 9781728196664 |
| DOIs | |
| State | Published - 2021 |
| Event | 2021 IEEE International Conference on Cluster Computing, Cluster 2021 - Virtual, Portland, United States Duration: Sep 7 2021 → Sep 10 2021 |
Publication series
| Name | Proceedings - IEEE International Conference on Cluster Computing, ICCC |
|---|---|
| Volume | 2021-September |
| ISSN (Print) | 1552-5244 |
Conference
| Conference | 2021 IEEE International Conference on Cluster Computing, Cluster 2021 |
|---|---|
| Country/Territory | United States |
| City | Virtual, Portland |
| Period | 09/7/21 → 09/10/21 |
Funding
This research used computational resources of supercomputer Fugaku provided by the RIKEN Center for Computational Science as part of the DOE (USA)- MEXT (Japan) collaboration. This research used resources of the Oak Ridge Leadership Computing Facility, which is a DOE Office of Science User Facility supported under Contract DE-AC05-00OR22725. This research was supported by the Exascale Computing Project (17-SC-20-SC), a collaborative effort of the U.S. Department of Energy Office of Science and the National Nuclear Security Administration. This research was supported as part of the Energy Exascale Earth System Model (E3SM) project, funded by the U.S. Department of Energy, Office of Science, Office of Biological and Environmental Research. Sandia National Laboratories is a multimission laboratory managed and operated by National Technology & Engineering Solutions of Sandia, LLC, a wholly owned subsidiary of Honeywell International Inc., for the U.S. Department of Energy's National Nuclear Security Administration under contract DENA0003525. This paper describes objective technical results and analysis. Any subjective views or opinions that might be expressed in the paper do not necessarily represent the views of the U.S. Department of Energy or the United States Government. We would like to thank Oksana Guba of Sandia National Laboratory for assistance with the benchmark setup and related data for architecture comparison. Department of Energy Office of Science and the National Nuclear Security Administration. This research was supported as part of the Energy Exascale Earth System Model (E3SM) project, funded by the U.S. Department of Energy, Office of Science, Office of Biological and Environmental Research. Sandia National Laboratories is a multimission laboratory managed and operated by National Technology & Engineering Solutions of Sandia, LLC, a wholly owned subsidiary of Honeywell International Inc., for the U.S. Department of Energy’s National Nuclear Security Administration under contract DE-NA0003525. This paper describes objective technical results and analysis. Any subjective views or opinions that might be expressed in the paper do not necessarily represent the views of the U.S. Department of Energy or the United States Government. We would like to thank Oksana Guba of Sandia National Laboratory for assistance with the benchmark setup and related data for architecture comparison. One of the first obstacles encountered on Fugaku was the lack of CMake support for Fujitsu compilers and math libraries. We communicated the issue to Kitware, Arm and Fujitsu. Initial workaround and support was added by Kitware which has since been superseded by official support from Fujitsu. These developments have benefited all CMake based applications and libraries on Fugaku. This research used computational resources of supercomputer Fugaku provided by the RIKEN Center for Computational Science as part of the DOE (USA)-MEXT (Japan) collaboration. This research used resources of the Oak Ridge Leadership Computing Facility, which is a DOE Office of Science User Facility supported under Contract DE-AC05-00OR22725. This research was supported by the Exascale Computing Project (17-SC-20-SC), a collaborative effort of the U.S. This manuscript has been authored in part by UT-Battelle, LLC, under contract DE-AC05-00OR22725 with the US Department of Energy (DOE). The US government retains and the publisher, by accepting the article for publication, acknowledges that the US government retains a nonexclusive, paid-up, irrevocable, worldwide license to publish or reproduce the published form of this manuscript, or allow others to do so, for US government purposes. DOE will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan (http://energy.gov/ downloads/doe-public-access-plan).
Keywords
- Architecture Evaluation
- Climate Model
- Workload Characterization