TY - GEN
T1 - Retrospect
T2 - 14th European PVM/MPI Users' Group Meeting on Parallel Virtual Machine and Message Passing Interface
AU - Bouteiller, Aurelien
AU - Bosilca, George
AU - Dongarra, Jack
PY - 2007
Y1 - 2007
N2 - While high performance computing was eagerly adopted by users as a vehicle for satisfying a growing demand on computational power, some areas are still poorly explored. The MPI paradigm is considered as being the keystone for the large development of the HPC infrastructure over the last decade. However, even today the users have to face the lack of tools able to help increase the stability of the software stack and/or of the applications. In this paper we present and evaluate a tool designed to allow developers to further investigate the execution of parallel applications by enabling them to dynamically move back and forth in the execution timeline of a parallel application. Based on an unobtrusive message logging mechanism, deterministic replay is enforced, leading to a simpler and more efficient way to debug parallel software.
AB - While high performance computing was eagerly adopted by users as a vehicle for satisfying a growing demand on computational power, some areas are still poorly explored. The MPI paradigm is considered as being the keystone for the large development of the HPC infrastructure over the last decade. However, even today the users have to face the lack of tools able to help increase the stability of the software stack and/or of the applications. In this paper we present and evaluate a tool designed to allow developers to further investigate the execution of parallel applications by enabling them to dynamically move back and forth in the execution timeline of a parallel application. Based on an unobtrusive message logging mechanism, deterministic replay is enforced, leading to a simpler and more efficient way to debug parallel software.
UR - https://www.scopus.com/pages/publications/38449105996
U2 - 10.1007/978-3-540-75416-9_41
DO - 10.1007/978-3-540-75416-9_41
M3 - Conference contribution
AN - SCOPUS:38449105996
SN - 9783540754152
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 297
EP - 306
BT - Recent Advances in Parallel Virtual Machine and Message Passing Interface - 14th European PVM/MPI Users' Group Meeting, Proceedings
PB - Springer Verlag
Y2 - 30 September 2007 through 3 October 2007
ER -