Designing and implementing lightweight kernels for capability computing

Rolf Riesen, Ron Brightwell, Patrick G. Bridges, Trammell Hudson, Arthur B. Maccabe, Patrick M. Widener, Kurt Ferreira

Research output: Contribution to journalArticlepeer-review

35 Scopus citations

Abstract

In the early 1990s, researchers at Sandia National Laboratories and the University of New Mexico began development of customized system software for massively parallel 'capability' computing platforms. These lightweight kernels have proven to be essential for delivering the full power of the underlying hardware to applications. This claim is underscored by the success of several supercomputers, including the Intel Paragon, Intel Accelerated Strategic Computing Initiative Red, and the Cray XT series of systems, each having established a new standard for high-performance computing upon introduction. In this paper, we describe our approach to lightweight compute node kernel design and discuss the design principles that have guided several generations of implementation and deployment. A broad strategy of operating system specialization has led to a focus on user-level resource management, deterministic behavior, and scalable system services. The relative importance of each of these areas has changed over the years in response to changes in applications and hardware and system architecture. We detail our approach and the associated principles, describe how our application of these principles has changed over time, and provide design and performance comparisons to contemporaneous supercomputing operating systems.

Original languageEnglish
Pages (from-to)793-817
Number of pages25
JournalConcurrency and Computation: Practice and Experience
Volume21
Issue number6
DOIs
StatePublished - Apr 25 2009
Externally publishedYes

Keywords

  • Operating systems
  • Parallel computing

Fingerprint

Dive into the research topics of 'Designing and implementing lightweight kernels for capability computing'. Together they form a unique fingerprint.

Cite this