Just in time: Adding value to the IO pipelines of high performance applications with JITStaging

Hasan Abbasi, Greg Eisenhauer, Matthew Wolf, Karsten Schwan, Scott Klasky

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

56 Scopus citations

Abstract

Large scale applications are generating a tsunami of data, with understanding driven by finding information hidden within this data. The ever-increasing sizes of output, however, are making it difficult for science users to inspect the data generated by their applications, understand its important properties, and/or organize it for subsequent analysis and visualization. This paper presents JITStager, a software infrastructure with which end users can dynamically customize and thus, add value to the output pipelines of their HEC applications. JITStager is able to customize data at scale, by leveraging the computational power of both compute nodes and of additional 'data staging' nodes allocated by end users. Using existing, componentized I/O interfaces to decouple the compile-time specification of the program and the run-time customization of the data pipeline, JITStager employs efficient runtime methods for binary code generation and data movement to create custom pipelines for applications' output processes that provide end users with improved insights into the data being produced, without burdening the application's computational performance and without impeding output performance. This paper describes the JITStager architecture, evaluates its performance, and demonstrates the advantages derived from its use with representative HPC applications.

Original languageEnglish
Title of host publicationHPDC'11 - Proceedings of the 20th International Symposium on High Performance Distributed Computing
Pages27-36
Number of pages10
DOIs
StatePublished - 2011
Event20th ACM International Symposium on High-Performance Parallel and Distributed Computing, HPDC'11 - San Jose, CA, United States
Duration: Jun 8 2011Jun 11 2011

Publication series

NameProceedings of the IEEE International Symposium on High Performance Distributed Computing
ISSN (Print)1082-8907

Conference

Conference20th ACM International Symposium on High-Performance Parallel and Distributed Computing, HPDC'11
Country/TerritoryUnited States
CitySan Jose, CA
Period06/8/1106/11/11

Keywords

  • datatap
  • gtc
  • i/o
  • infiniband
  • staging
  • warp
  • xt3
  • xt4

Fingerprint

Dive into the research topics of 'Just in time: Adding value to the IO pipelines of high performance applications with JITStaging'. Together they form a unique fingerprint.

Cite this