Abstract
The path to extreme scale high-performance computing (HPC) poses several challenges related to power, performance, resilience, productivity, programmability, data movement, and data management. Investigating the performance of parallel applications at scale on future architectures and the performance impact of different architectural choices is an important component of HPC hardware/software co-design. Simulations using models of future HPC systems and communication traces from applications running on existing HPC systems can offer an insight into the performance of future architectures. This work targets technology developed for scalable application tracing of communication events. It focuses on extreme-scale simulation of HPC applications and their communication behavior via lightweight parallel discrete event simulation for performance estimation and evaluation. Instead of simply replaying a trace within a simulator, this work promotes the generation of a benchmark from traces. This benchmark is subsequently exposed to simulation using models to reflect the performance characteristics of future-generation HPC systems. This technique provides a number of benefits, such as eliminating the data intensive trace replay and enabling simulations at different scales. The presented work features novel software co-design aspects, combining the ScalaTrace tool to generate scalable trace files, the ScalaBenchGen tool to generate the benchmark, and the xSim tool to assess the benchmark characteristics within a simulator.
Original language | English |
---|---|
Title of host publication | Proceedings - 2016 IEEE/ACM 20th International Symposium on Distributed Simulation and Real Time Applications, DS-RT 2016 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 9-18 |
Number of pages | 10 |
ISBN (Electronic) | 9781509035045 |
DOIs | |
State | Published - Dec 16 2016 |
Event | 20th IEEE/ACM International Symposium on Distributed Simulation and Real Time Applications, DS-RT 2016 - London, United Kingdom Duration: Sep 21 2016 → Sep 23 2016 |
Publication series
Name | Proceedings - IEEE International Symposium on Distributed Simulation and Real-Time Applications, DS-RT |
---|---|
ISSN (Print) | 1550-6525 |
Conference
Conference | 20th IEEE/ACM International Symposium on Distributed Simulation and Real Time Applications, DS-RT 2016 |
---|---|
Country/Territory | United Kingdom |
City | London |
Period | 09/21/16 → 09/23/16 |
Bibliographical note
Publisher Copyright:© 2016 IEEE.
Keywords
- Performance modeling
- application simulation
- application tracing
- high-performance computing