TY - GEN
T1 - Assessing the performance of MPI applications through time-independent trace replay
AU - Desprez, Frédéric
AU - Markomanolis, George S.
AU - Quinson, Martin
AU - Suter, Frédéric
PY - 2011
Y1 - 2011
N2 - Simulation is a popular approach to obtain objective performance indicators platforms that are not at one's disposal. It may help the dimensioning of compute clusters in large computing centers. In this work we present a framework for the off-line simulation of MPI applications. Its main originality with regard to the literature is to rely on time-independent execution traces. This allows us to completely decouple the acquisition process from the actual replay of the traces in a simulation context. Then we are able to acquire traces for large application instances without being limited to an execution on a single compute cluster. Finally our framework is built on top of a scalable, fast, and validated simulation kernel. In this paper, we introduce the used time-independent trace format, investigate several acquisition strategies, detail the developed trace replay tool, and assess the quality of our simulation framework in terms of accuracy, acquisition time, simulation time, and trace size.
AB - Simulation is a popular approach to obtain objective performance indicators platforms that are not at one's disposal. It may help the dimensioning of compute clusters in large computing centers. In this work we present a framework for the off-line simulation of MPI applications. Its main originality with regard to the literature is to rely on time-independent execution traces. This allows us to completely decouple the acquisition process from the actual replay of the traces in a simulation context. Then we are able to acquire traces for large application instances without being limited to an execution on a single compute cluster. Finally our framework is built on top of a scalable, fast, and validated simulation kernel. In this paper, we introduce the used time-independent trace format, investigate several acquisition strategies, detail the developed trace replay tool, and assess the quality of our simulation framework in terms of accuracy, acquisition time, simulation time, and trace size.
KW - Message passing interface
KW - Off-line simulation
KW - Performance prediction
UR - http://www.scopus.com/inward/record.url?scp=80155190351&partnerID=8YFLogxK
U2 - 10.1109/ICPPW.2011.33
DO - 10.1109/ICPPW.2011.33
M3 - Conference contribution
AN - SCOPUS:80155190351
SN - 9780769545110
T3 - Proceedings of the International Conference on Parallel Processing Workshops
SP - 467
EP - 476
BT - Proceedings - 2011 International Conference on Parallel Processing Workshops, ICPPW 2011
T2 - 2011 International Conference on Parallel Processing Workshops, ICPPW 2011
Y2 - 13 September 2011 through 16 September 2011
ER -