Abstract
This paper presents a quantitative evaluation of the power usage over time in data-intensive applications that use MapReduce over MPI. We leverage the PAPI powercap tool to identify ideal conditions for execution of our mini-applications in terms of (1) dataset characteristics (e.g., unique words in datasets); (2) system characteristics (e.g., KNL and KNM); and (3) implementation of the MapReduce programming model (e.g., impact of various optimizations). Results illustrate the high power utilization and runtime costs of data management on HPC architectures.
Original language | English |
---|---|
Title of host publication | Parallel Computing |
Subtitle of host publication | Technology Trends |
Editors | Ian Foster, Gerhard R. Joubert, Ludek Kucera, Wolfgang E. Nagel, Frans Peters |
Publisher | IOS Press BV |
Pages | 287-298 |
Number of pages | 12 |
ISBN (Electronic) | 9781643680705 |
DOIs | |
State | Published - 2020 |
Publication series
Name | Advances in Parallel Computing |
---|---|
Volume | 36 |
ISSN (Print) | 0927-5452 |
ISSN (Electronic) | 1879-808X |
Funding
This work was supported by NSF CCF 1841758. This work was supported by NSF CCF 1841758
Keywords
- Combiner optimizations
- Data management
- KNL
- KNM
- PAPI