Developing Ultrahigh-Resolution E3SM Land Model for GPU Systems

Peter Schwartz, Dali Wang, Fengming Yuan, Peter Thornton

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Designing and refactoring complex scientific code, such as the E3SM land model (ELM), for new computing architectures is challenging. This paper presents design strategies and technical approaches to develop a data-oriented, GPU-ready ELM model using compiler directives (OpenACC/OpenMP). We first analyze the datatypes and processes in the original ELM code. Then we present design considerations for ultrahigh-resolution ELM (uELM) development for massive GPU systems. These techniques include the global data-oriented simulation workflow, domain partition, code porting and data copy, memory reduction, parallel loop restructure and flattening, and race condition detection. We implemented the first version of uELM using OpenACC targeting the NVidia GPUs in the Summit supercomputer at Oak Ridge National Laboratory. During the implementation, we developed a software tool (named SPEL) to facilitate code generation, verification, and performance tuning using these techniques. The first uELM implementation for Nvidia GPUs on Summit delivered promising results: 1) over 98% of the ELM code was automatically generated and tuned by scripts. Most ELM modules had better computational performances than the original ELM code for CPUs. The GPU-ready uELM is more scalable than the CPU code on fully-loaded Summit nodes. Example profiling results from several modules are also presented to illustrate the performance improvements and race condition detection. The lessons learned and toolkit developed in the study are also suitable for further uELM deployment using OpenMP on the first US exascale computer, Frontier, equipped with AMD CPUs and GPUs.

Original languageEnglish
Title of host publicationComputational Science and Its Applications – ICCSA 2023 - 23rd International Conference, Proceedings
EditorsOsvaldo Gervasi, Beniamino Murgante, David Taniar, Bernady O. Apduhan, Ana Cristina Braga, Chiara Garau, Anastasia Stratigea
PublisherSpringer Science and Business Media Deutschland GmbH
Pages277-290
Number of pages14
ISBN (Print)9783031368042
DOIs
StatePublished - 2023
Event23rd International Conference on Computational Science and Its Applications , ICCSA 2023 - Athens, Greece
Duration: Jul 3 2023Jul 6 2023

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13956 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference23rd International Conference on Computational Science and Its Applications , ICCSA 2023
Country/TerritoryGreece
CityAthens
Period07/3/2307/6/23

Funding

Acknowledgments. This work has been supported by FCT - Funda¸cão para a Ciência e Tecnologia within the R&D Units Project Scope: UIDB/00319/2020 and the project “QOMPASS .: Solu¸cão de Gestão de Servi¸cos de Atendimento multi-entidade, multi-servi¸co e multi-idioma” within the Project Scope NORTE-01-0247-FEDER-038462. Acknowledgments. This study was grant No. 21-71-00039. Acknowledgement. This work was supported in part by Sunway University and Sunway Business School under Kick Start Grant Scheme (KSGS) NO: GRTIN-KSGS-DBA[S]-02-2022. This work is also part of the Sustainable Business Research Cluster and Research Centre for Human-Machine Collaboration (HUMAC) at Sunway University. We also wish to thank those who have supported this research. Supported by organization Conselho Nacional de Desenvolvimento Científico e Tec-nológico (CNPq). Acknowledgements. This study was financed in part by the Coordena¸cão de Aper-fei¸coamento de Pessoal de Ńıvel Superior - Brasil (CAPES). ICCSA 2023 was organized by the National Technical University of Athens (Greece), the University of the Aegean (Greece), the University of Perugia (Italy), the University of Basilicata (Italy), Monash University (Australia), Kyushu Sangyo University (Japan), the University of Minho (Portugal). The conference was supported by two NTUA Schools, namely the School of Rural, Surveying and Geoinformatics Engineering and the School of Electrical and Computer Engineering. This study was nanced in part by the Coordenac ao de Aperfeicoamento de Pessoal de Nvel Superior - Brasil (CAPES) - Finance Code 001. This work was partially funded by CNPq, CAPES, FINEP and Fapemig. Acknowledgement. This research was supported by the MUNI Award in Science and Humanities (MASH Belarus) of the Grant Agency of Masaryk University under the Digital City project (MUNI/J/0008/2021). The work of Stanislav Sobolevsky was also partially supported by ERDF “CyberSecurity, CyberCrime and Critical Information Infrastructures Center of Excellence” (No. CZ.02.1.01/0.0/0.0/16 019/0000822). Acknowledgements. The author sincerely acknowledges the financial support under Supported by organization LNCC/SEPAD. University of Perugia, Italy Universidade Nova de Lisboa, Portugal University of Beira Interior, Portugal University of Almeria, Spain University of Salerno, Italy Erciyes University, Turkey University of Naples “Federico II”, Italy Sungkyunkwan University, Korea Sunway University, Malaysia Sungkyunkwan University, Korea Polytechnic Institute of Viana do Castelo, Portugal Federal University of Bahia, Brazil INFN, Italy Universidade Federal do Rio Grande do Sul, Brazil The Council for Scientific and Industrial Research (CSIR), South Africa Instituto Tecnológico de Informática, Spain Kausan University of Technology, Lithuania London South Bank University, UK Memorial University of Newfoundland, Canada University of Coimbra, Portugal University of L’Aquila, Italy NetApp, India/USA University of Perugia, Italy University of Minho, Portugal U.S. DOE Ames Laboratory, USA Polytechnic Institute of Bragança, Portugal National Centre for Biotechnology, CSIS, Spain Polytechnic Institute of Bragança, Portugal University of Aveiro, Portugal by the Russian Science Foundation Acknowledgements. The authors would like to thank Conselho Nacional de Desen-volvimento Científico e Tecnológico (CNPq), Coordenação de Aperfeiçoamento de Pes-soal de Nível Superior (Capes) and Pontifícia Universidade Católica do Rio de Janeiro (PUC-Rio) for their financial support. Supported by the Government of Spain, Department of Science, Innovation and Universities; European Commision project: PID2021-125871NB-I00.

FundersFunder number
Canada University of Coimbra
DOE Ames Laboratory
Department of Science, Innovation and Universities
Grant Agency of Masaryk UniversityMUNI/J/0008/2021
India/USA University of Perugia
Instituto Tecnológico de Informática, Spain Kausan University of Technology, Lithuania London South Bank University, UK Memorial University of Newfoundland
Italy Erciyes University
Italy Sungkyunkwan University
Italy University of Minho
Korea Polytechnic Institute of Viana do Castelo
Korea Sunway University, Malaysia Sungkyunkwan University
LNCC
Portugal Federal University of Bahia
Portugal National Centre for Biotechnology
Portugal University of Almeria, Spain University of Salerno
Portugal University of Aveiro
Portugal University of Beira Interior
Portugal University of L’Aquila
SEPAD
School of Rural, Surveying and Geoinformatics Engineering
Sunway Business School-02-2022
Turkey University of Naples
Center for Strategic and International Studies
European CommissionPID2021-125871NB-I00
Council for Scientific and Industrial Research, South Africa
Fundação para a Ciência e a TecnologiaUIDB/00319/2020
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior
Conselho Nacional de Desenvolvimento Científico e Tecnológico
Financiadora de Estudos e Projetos
Fundação de Amparo à Pesquisa do Estado de Minas Gerais
Universidade Nova de Lisboa
Russian Science Foundation17-71-30029
European Regional Development FundCZ.02.1.01/0.0/0.0/16 019/0000822
Università degli Studi di Perugia
Sunway University
School of Electrical and Computer Engineering,University of Tehran
Instituto Politécnico de Bragança
Universidade Federal de Mato Grosso do Sul
National Technical University of Athens
Pontifícia Universidade Católica do Rio de Janeiro

    Keywords

    • Compiler Directives
    • E3SM Land Model
    • Exascale Energy Earth System Model
    • OpenACC
    • Ultrahigh-Resolution ELM

    Fingerprint

    Dive into the research topics of 'Developing Ultrahigh-Resolution E3SM Land Model for GPU Systems'. Together they form a unique fingerprint.

    Cite this