Abstract
In order to better understand the impact of hardware and software data prefetching on scientific application performance, this paper introduces two analysis techniques, one micro-architecture-centric and the other application-centric.We use these techniques to analyze representative full-scale production applications from five important Exascale target areas. We find that despite a great diversity in prefetching effectiveness across and even within applications, there is a strong correlation between regions where prefetching is most needed, due to high levels of memory traffic, and where it is most effective. We also observe that the application-centric analysis can explain many of the differences in prefetching effectiveness observed across the studied applications.
Original language | English |
---|---|
Title of host publication | High Performance Computing Systems |
Subtitle of host publication | Performance Modeling, Benchmarking and Simulation - 4th International Workshop, PMBS 2013, Revised Selected Papers |
Editors | Stephen A. Jarvis, Steven A. Wright, Simon D. Hammond |
Publisher | Springer Verlag |
Pages | 115-135 |
Number of pages | 21 |
ISBN (Electronic) | 9783319102139 |
DOIs | |
State | Published - 2014 |
Event | 4th International Workshop on Performance Modeling, Benchmarking and Simulation of High-Performance Computing Systems, PMBS 2013 - Denver, United States Duration: Nov 18 2013 → Nov 18 2013 |
Publication series
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Volume | 8551 |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | 4th International Workshop on Performance Modeling, Benchmarking and Simulation of High-Performance Computing Systems, PMBS 2013 |
---|---|
Country/Territory | United States |
City | Denver |
Period | 11/18/13 → 11/18/13 |
Bibliographical note
Publisher Copyright:© Springer International Publishing Switzerland 2014.
Keywords
- Data streaming
- Performance evaluation
- Prefetching