TY - GEN
T1 - Improving the accuracy and efficiency of time-independent trace replay
AU - Desprez, Frederic
AU - Markomanolis, George S.
AU - Suter, Frederic
PY - 2012
Y1 - 2012
N2 - Simulation is a popular approach to obtain objective performance indicators on platforms that are not at one's disposal. It may help the dimensioning of compute clusters in large computing centers. In a previous work, we proposed a framework for the off-line simulation of MPI applications. Its main originality with regard to the literature is to rely on time-independent execution traces. This allows us to completely decouple the acquisition process from the actual replay of the traces in a simulation context. Then we are able to acquire traces for large application instances without being limited to an execution on a single compute cluster. Finally our framework is built on top of a scalable, fast, and validated simulation kernel. In this paper, we detail the performance issues that we encountered with the first implementation of our trace replay framework. We propose several modifications to address these issues and analyze their impact. Results show a clear improvement on the accuracy and efficiency with regard to the initial implementation.
AB - Simulation is a popular approach to obtain objective performance indicators on platforms that are not at one's disposal. It may help the dimensioning of compute clusters in large computing centers. In a previous work, we proposed a framework for the off-line simulation of MPI applications. Its main originality with regard to the literature is to rely on time-independent execution traces. This allows us to completely decouple the acquisition process from the actual replay of the traces in a simulation context. Then we are able to acquire traces for large application instances without being limited to an execution on a single compute cluster. Finally our framework is built on top of a scalable, fast, and validated simulation kernel. In this paper, we detail the performance issues that we encountered with the first implementation of our trace replay framework. We propose several modifications to address these issues and analyze their impact. Results show a clear improvement on the accuracy and efficiency with regard to the initial implementation.
KW - MPI
KW - Performance prediction
KW - Simulation
UR - http://www.scopus.com/inward/record.url?scp=84876549768&partnerID=8YFLogxK
U2 - 10.1109/SC.Companion.2012.64
DO - 10.1109/SC.Companion.2012.64
M3 - Conference contribution
AN - SCOPUS:84876549768
SN - 9780769549569
T3 - Proceedings - 2012 SC Companion: High Performance Computing, Networking Storage and Analysis, SCC 2012
SP - 446
EP - 455
BT - Proceedings - 2012 SC Companion
T2 - 2012 SC Companion: High Performance Computing, Networking Storage and Analysis, SCC 2012
Y2 - 10 November 2012 through 16 November 2012
ER -