TY - GEN
T1 - Exascale workload characterization and architecture implications
AU - Balaprakash, Prasanna
AU - Buntinas, Darius
AU - Chan, Anthony
AU - Guha, Apala
AU - Gupta, Rinku
AU - Narayanan, Sri Hari Krishna
AU - Chien, Andrew A.
AU - Hovland, Paul
AU - Norris, Boyana
PY - 2013
Y1 - 2013
N2 - We use a hybrid methodology based on binary instrumentation and performance counters to characterize a set of proxy applications (mini-apps and PETSc applications) representative of a broad range of scientific applications (and particularly DOE's future high performance computing workloads). From this empirical basis, we create statistical models that extrapolate application properties (instruction mix, memory size, and memory bandwidth) as a function of problem size. We validate them and project the first quantitative characterization of an exascale computing workload. Finally, the exascale workload is used to evaluate a radical new exascale architecture, stacked DRAM with processor under memory (PUM). Of the two projections, one shows major potential benefits in using PUM. However, the second, more conservative projection suggests that only a small number of exascale applications are likely to be memory-bandwidth limited, but even these are fundamentally memory-capacity limited.
AB - We use a hybrid methodology based on binary instrumentation and performance counters to characterize a set of proxy applications (mini-apps and PETSc applications) representative of a broad range of scientific applications (and particularly DOE's future high performance computing workloads). From this empirical basis, we create statistical models that extrapolate application properties (instruction mix, memory size, and memory bandwidth) as a function of problem size. We validate them and project the first quantitative characterization of an exascale computing workload. Finally, the exascale workload is used to evaluate a radical new exascale architecture, stacked DRAM with processor under memory (PUM). Of the two projections, one shows major potential benefits in using PUM. However, the second, more conservative projection suggests that only a small number of exascale applications are likely to be memory-bandwidth limited, but even these are fundamentally memory-capacity limited.
UR - http://www.scopus.com/inward/record.url?scp=84876839343&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84876839343
SN - 9781627480338
T3 - Simulation Series
SP - 30
EP - 37
BT - Proceedings of the 2013 Spring Simulation Multiconference, SpringSim 2013 - High Performance Computing Symposium, HPC 2013
T2 - High Performance Computing Symposium, HPC 2013, Part of the 2013 Spring Simulation Multiconference, SpringSim 2013
Y2 - 7 April 2013 through 10 April 2013
ER -