Dodging the cost of unavoidable memory copies in message logging protocols

George Bosilca, Aurelien Bouteiller, Thomas Herault, Pierre Lemarinier, Jack J. Dongarra

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

6 Scopus citations

Abstract

With the number of computing elements spiraling to hundred of thousands in modern HPC systems, failures are common events. Few applications are nevertheless fault tolerant; most are in need for a seamless recovery framework. Among the automatic fault tolerant techniques proposed for MPI, message logging is preferable for its scalable recovery. The major challenge for message logging protocols is the performance penalty on communications during failure-free periods, mostly coming from the payload copy introduced for each message. In this paper, we investigate different approaches for logging payload and compare their impact on network performance.

Original languageEnglish
Title of host publicationRecent Advances in the Message Passing Interface - 17th European MPI Users' Group Meeting, EuroMPI 2010, Proceedings
Pages189-197
Number of pages9
DOIs
StatePublished - 2010
Event17th European MPI Users' Group Meeting, EuroMPI 2010 - Stuttgart, Germany
Duration: Sep 12 2010Sep 15 2010

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume6305 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference17th European MPI Users' Group Meeting, EuroMPI 2010
Country/TerritoryGermany
CityStuttgart
Period09/12/1009/15/10

Fingerprint

Dive into the research topics of 'Dodging the cost of unavoidable memory copies in message logging protocols'. Together they form a unique fingerprint.

Cite this