Abstract
Parallel application benchmarks are indispensable for evaluating/optimizing HPC software and hardware. However, it is very challenging and costly to obtain high-fidelity benchmarks reecting the scale and complexity of state-of-the-art parallel applications. Hand-extracted synthetic benchmarks are time- and labor-intensive to create. Real applications themselves, while offering most accurate performance evaluation, are expensive to compile, port, reconfigure, and often plainly inaccessible due to security or ownership concerns. This work contributes APPrime, a novel tool for tracebased automatic parallel benchmark generation. Taking as input standard communication-I/O traces of an application's execution, it couples accurate automatic phase identification with statistical regeneration of event parameters to create compact, portable, and to some degree reconfigurable parallel application benchmarks. Experiments with four NAS Parallel Benchmarks (NPB) and three real scientific simulation codes confirm the fidelity of APPrime benchmarks. They retain the original applications' performance characteristics, in particular their relative performance across platforms. Also, the result benchmarks, already released online, are much more compact and easy-toport compared to the original applications.
Original language | English |
---|---|
Pages (from-to) | 309-320 |
Number of pages | 12 |
Journal | Performance Evaluation Review |
Volume | 43 |
Issue number | 1 |
DOIs | |
State | Published - Jun 24 2015 |
Event | ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, SIGMETRICS 2015 - Portland, United States Duration: Jun 15 2015 → Jun 19 2015 |
Funding
We thank the anonymous reviewers for their valuable com-ments and suggestions. The work is facilitated by QCRI''s in-ternship program. It is also partially supported by research grants at involved institutes: NSF awards CCF-0937908, CCF-0937690, and CNS-1318564 (NCSU), DOE O_ce of Science, O_ce of Advanced Scienti_c Computing Research, SciDAC program, and Oak Ridge Leadership Computing Facility (ORNL), and the National High-Tech Research and Development Plan (863 project) 2012AA01A302 and NSFC project 61472201 (Tsinghua University).
Funders | Funder number |
---|---|
DOE O_ce of Science | |
Oak Ridge National Laboratory | ORNL |
QCRI | |
National Sleep Foundation | CNS-1318564, NCSU, CCF-0937908, CCF-0937690 |
National Natural Science Foundation of China | 61472201 |
Tsinghua University | |
National High-tech Research and Development Program | 2012AA01A302 |
Keywords
- Asynchronous I/O
- Benchmark generation
- HPC applications
- Markov chain model
- Phase identification
- Traces