Investigating operating system noise in extreme-scale high-performance computing systems using simulation

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

11 Scopus citations

Abstract

Hardware/software co-design for future-generation highperformance computing (HPC) systems aims at closing the gap between the peak capabilities of the hardware and the performance realized by applications (applicationarchitecture performance gap). Performance profiling of architectures and applications is a crucial part of this iterative process. The work in this paper focuses on operating system (OS) noise as an additional factor to be considered for co-design. It represents the first step in including OS noise in HPC hardware/software co-design by adding a noise injection feature to an existing simulation-based co-design toolkit. It reuses an existing abstraction for OS noise with frequency (periodic recurrence) and period (duration of each occurrence) to enhance the processor model of the Extreme-scale Simulator (xSim) with synchronized and random OS noise simulation. The results demonstrate this capability by evaluating the impact of OS noise on MPI Bcast() and MPI Reduce() in a simulated futuregeneration HPC system with 2,097,152 compute nodes.

Original languageEnglish
Title of host publicationIASTED Multiconferences - Proceedings of the IASTED International Conference on Parallel and Distributed Computing and Networks, PDCN 2013
Pages670-677
Number of pages8
DOIs
StatePublished - 2013
Event11th IASTED International Conference on Parallel and Distributed Computing and Networks, PDCN 2013 - Innsbruck, Austria
Duration: Feb 11 2013Feb 13 2013

Publication series

NameIASTED Multiconferences - Proceedings of the IASTED International Conference on Parallel and Distributed Computing and Networks, PDCN 2013

Conference

Conference11th IASTED International Conference on Parallel and Distributed Computing and Networks, PDCN 2013
Country/TerritoryAustria
CityInnsbruck
Period02/11/1302/13/13

Keywords

  • High-performance computing
  • Operating system noise
  • Parallel discrete event simulation
  • Performance evaluation

Fingerprint

Dive into the research topics of 'Investigating operating system noise in extreme-scale high-performance computing systems using simulation'. Together they form a unique fingerprint.

Cite this