TY - GEN
T1 - Understanding Performance Interference in Next-Generation HPC Systems
AU - Mondragon, Oscar H.
AU - Bridges, Patrick G.
AU - Levy, Scott
AU - Ferreira, Kurt B.
AU - Widener, Patrick
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2016/7/2
Y1 - 2016/7/2
N2 - Next-generation systems face a wide range of new potential sources of application interference, including resilience actions, system software adaptation, and in situ analytics programs. In this paper, we present a new model for analyzing the performance of bulk-synchronous HPC applications based on the use of extreme value theory. After validating this model against both synthetic and real applications, the paper then uses both simulation and modeling techniques to profile next-generation interference sources and characterize their behavior and performance impact on a selection of HPC benchmarks, mini-applications, and applications. Lastly, this work shows how the model can be used to understand how current interference mitigation techniques in multi-processors work.
AB - Next-generation systems face a wide range of new potential sources of application interference, including resilience actions, system software adaptation, and in situ analytics programs. In this paper, we present a new model for analyzing the performance of bulk-synchronous HPC applications based on the use of extreme value theory. After validating this model against both synthetic and real applications, the paper then uses both simulation and modeling techniques to profile next-generation interference sources and characterize their behavior and performance impact on a selection of HPC benchmarks, mini-applications, and applications. Lastly, this work shows how the model can be used to understand how current interference mitigation techniques in multi-processors work.
UR - http://www.scopus.com/inward/record.url?scp=85017239152&partnerID=8YFLogxK
U2 - 10.1109/SC.2016.32
DO - 10.1109/SC.2016.32
M3 - Conference contribution
AN - SCOPUS:85017239152
T3 - International Conference for High Performance Computing, Networking, Storage and Analysis, SC
SP - 384
EP - 395
BT - Proceedings of SC 2016
PB - IEEE Computer Society
T2 - 2016 International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2016
Y2 - 13 November 2016 through 18 November 2016
ER -