Abstract
Emerging HPC platforms are becoming more difficult to program as a result of systems with different node architectures, some with a small number of "fat"heterogenous nodes (consisting of multiple accelerators) and others with a large number of "thin"homogenous nodes consisting of multi-core CPUs connected with high speed interconnects. New programming models are emerging to address performance portability of the applications as well as a set of scientific libraries that applications can use to exploit these architectures efficiently. To port applications to new architectures, developers need information about their source code characteristics including static and dynamic (e.g. performance) information to refactor the code, understand their data and code structure, and library usage as well as program information to direct their optimisation efforts and make key decisions. In this paper, we describe a tool that combines compiler and profiler information to query program characteristics in a given programming environment. Static and dynamic data about applications is collected and stored together in an SQL database that can be later queried to study application characteristics and patterns. We will demonstrate the capabilities of this tool with an application-driven case study that aims at understanding application code and its use of scientific libraries via a real world example from the molecular simulation application CP2K.
Original language | English |
---|---|
Title of host publication | 2021 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2021 - In conjunction with IEEE IPDPS 2021 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 420-429 |
Number of pages | 10 |
ISBN (Electronic) | 9781665435772 |
DOIs | |
State | Published - Jun 2021 |
Event | 2021 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2021 - Virtual, Portland, United States Duration: May 17 2021 → … |
Publication series
Name | 2021 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2021 - In conjunction with IEEE IPDPS 2021 |
---|
Conference
Conference | 2021 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2021 |
---|---|
Country/Territory | United States |
City | Virtual, Portland |
Period | 05/17/21 → … |
Funding
We would like to thank Terry Jones and Ada Sedova for their guidance and feedback on this work. Some of this work was supported by the Exascale Computing Project (17-SC-20-SC), a collaborative effort of the United States Department of Energy Office of Science and the National Nuclear Security Administration, and used resources of the Computational Research and Development Programs and the OLCF at Oak Ridge National Laboratory. This manuscript has been authored by UT-Battelle, LLC under contract no. DE-AC05-00OR22725 with the United States Department of Energy. The United States government retains, and the publisher, by accepting the article for publication, acknowledges that the United States government retains a non-exclusive, paid-up, irrevocable, worldwide license to publish or reproduce the published form of this manuscript, or allow others to do so, for United States government purposes. The Department of Energy will provide public access to these results of federally sponsored research in accordance with the DoE Public Access Plan (http://energy.gov/downloads/doe-public-access-plan).
Keywords
- databases
- dynamic analysis
- static analysis