Out-of-core wavefront computations with reduced synchronization

Pierre Nicolas Clauss, Jens Gustedt, Frédéric Suter

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

Matrix computation algorithms often exhibit dependencies between neighboring elements inside loop nests such that the frontier between computed elements and those to be computed wanders inform of a 'wave' through the matrix. Macro-pipelining techniques can achieve an efficient parallelization of such algorithms by overlapping communication and computation. Usually these techniques are limited to situations where all the data to be processed fits into main memory, whereas for larger data the I/O usage pattern for external storage requires special attention. The work [5] presented a first extension of the wavefront framework to these so-called out-of-core problems. The present paper proposes a redesign of their algorithm that minimizes both overhead and perturbations coming from communications. To tackle the issue of non-contiguous I/O, we also propose an optimized data layout. These two major modifications of the original algorithm eventually allow us to present a third improvement as our implementation shortens the transition phase between two consecutive iterations of the wavefront algorithm. Experiments performed with the PARXXL library show that we can significantly reduce the time lost during inefficient I/O operations and thus obtain faster computations.

Original languageEnglish
Title of host publicationProceedings of the 16th Euromicro Conference on Parallel, Distributed and Network-Based Processing, PDP 2008
Pages293-300
Number of pages8
DOIs
StatePublished - 2008
Externally publishedYes
Event16th Euromicro Conference on Parallel, Distributed and Network-Based Processing, PDP 2008 - Toulouse, France
Duration: Feb 13 2008Feb 15 2008

Publication series

NameProceedings of the 16th Euromicro Conference on Parallel, Distributed and Network-Based Processing, PDP 2008

Conference

Conference16th Euromicro Conference on Parallel, Distributed and Network-Based Processing, PDP 2008
Country/TerritoryFrance
CityToulouse
Period02/13/0802/15/08

Fingerprint

Dive into the research topics of 'Out-of-core wavefront computations with reduced synchronization'. Together they form a unique fingerprint.

Cite this