Just in time: Adding value to the IO pipelines of high performance applications with JITStaging

Hasan Abbasi, Greg Eisenhauer, Matthew Wolf, Karsten Schwan, Scott Klasky

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    56 Scopus citations

    Abstract

    Large scale applications are generating a tsunami of data, with understanding driven by finding information hidden within this data. The ever-increasing sizes of output, however, are making it difficult for science users to inspect the data generated by their applications, understand its important properties, and/or organize it for subsequent analysis and visualization. This paper presents JITStager, a software infrastructure with which end users can dynamically customize and thus, add value to the output pipelines of their HEC applications. JITStager is able to customize data at scale, by leveraging the computational power of both compute nodes and of additional 'data staging' nodes allocated by end users. Using existing, componentized I/O interfaces to decouple the compile-time specification of the program and the run-time customization of the data pipeline, JITStager employs efficient runtime methods for binary code generation and data movement to create custom pipelines for applications' output processes that provide end users with improved insights into the data being produced, without burdening the application's computational performance and without impeding output performance. This paper describes the JITStager architecture, evaluates its performance, and demonstrates the advantages derived from its use with representative HPC applications.

    Original languageEnglish
    Title of host publicationHPDC'11 - Proceedings of the 20th International Symposium on High Performance Distributed Computing
    Pages27-36
    Number of pages10
    DOIs
    StatePublished - 2011
    Event20th ACM International Symposium on High-Performance Parallel and Distributed Computing, HPDC'11 - San Jose, CA, United States
    Duration: Jun 8 2011Jun 11 2011

    Publication series

    NameProceedings of the IEEE International Symposium on High Performance Distributed Computing
    ISSN (Print)1082-8907

    Conference

    Conference20th ACM International Symposium on High-Performance Parallel and Distributed Computing, HPDC'11
    Country/TerritoryUnited States
    CitySan Jose, CA
    Period06/8/1106/11/11

    Keywords

    • datatap
    • gtc
    • i/o
    • infiniband
    • staging
    • warp
    • xt3
    • xt4

    Fingerprint

    Dive into the research topics of 'Just in time: Adding value to the IO pipelines of high performance applications with JITStaging'. Together they form a unique fingerprint.

    Cite this