Trace-based performance analysis for the petascale simulation code FLASH

Heike Jagode, Andreas Knüpfer, Jack Dongarra, Matthias Jurenz, Matthias S. Müller, Wolfgang E. Nagel

Research output: Contribution to journalArticlepeer-review

2 Scopus citations

Abstract

Performance analysis of applications on modern high-end petascale systems is increasingly challenging due to the rising complexity and quantity of the computing units. This paper presents a performance-analysis study using the Vampir performance-analysis tool suite, which examines application behavior as well as the fundamental system properties. This study was carried out on the Jaguar system at Oak Ridge National Laboratory, the fastest computer on the November 2009 Top500 list. We analyzed the FLASH simulation code that is designed to be scaled with tens of thousands of CPU cores, which means that using existing performance-analysis tools is very complex. The study reveals two classes of performance problems that are relevant for very high CPU counts: MPI communication and scalable I/O. For both, solutions are presented and verified. Finally, the paper proposes improvements and extensions for event tracing tools in order to allow scalability of the tools towards higher degrees of parallelism.

Original languageEnglish
Pages (from-to)428-439
Number of pages12
JournalInternational Journal of High Performance Computing Applications
Volume25
Issue number4
DOIs
StatePublished - Nov 2011
Externally publishedYes

Funding

This research was sponsored by the Office of Mathematical, Information, and Computational Sciences of the Office of Science, US Department of Energy, under Contract No. DE-AC05-00OR22725 with UT-Battelle, LLC. This work used resources of the National Center for Computational Sciences at Oak Ridge National Laboratory, which is supported by the Office of Science of the Department of Energy under Contract DE-AC05-00OR22725. This resource was made available via the Performance Evaluation and Analysis Consortium End Station, a Department of Energy INCITE project.

FundersFunder number
Office of Science of the Department of Energy
U.S. Department of EnergyDE-AC05-00OR22725
Office of Science

    Keywords

    • Vampir
    • collective I/O
    • collective MPI operations
    • event tracing
    • libNBC

    Fingerprint

    Dive into the research topics of 'Trace-based performance analysis for the petascale simulation code FLASH'. Together they form a unique fingerprint.

    Cite this