X-composer: Enabling cross-environments in-situ workflows between HPC and cloud

Feng Li, Dali Wang, Feng Yan, Fengguang Song

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

As large-scale scientific simulations and big data analyses become more popular, it is increasingly more expensive to store huge amounts of raw simulation results to perform post-analysis. To minimize the expensive data I/O, "in-situ"analysis is a promising approach, where data analysis applications analyze the simulation generated data on the fly without storing it first. However, it is challenging to organize, transform, and transport data at scales between two semantically different ecosystems due to the distinct software and hardware difference. To tackle these challenges, we design and implement the X-Composer framework. X-Composer connects cross-ecosystem applications to form an "in-situ"scientific workflow, and provides a unified approach and recipe for supporting such hybrid in-situ workflows on distributed heterogeneous resources. X-Composer reorganizes simulation data as continuous data streams and feeds them seamlessly into the Cloud-based stream processing services to minimize I/O overheads. For evaluation, we use X-Composer to set up and execute a cross-ecosystem workflow, which consists of a parallel Computational Fluid Dynamics simulation running on HPC, and a distributed Dynamic Mode Decomposition analysis application running on Cloud. Our experimental results show that X-Composer can seamlessly couple HPC and Big Data jobs in their own native environments, achieve good scalability, and provide high-fidelity analytics for ongoing simulations in real-time.

Original languageEnglish
Title of host publicationProceedings of the Platform for Advanced Scientific Computing Conference, PASC 2021
PublisherAssociation for Computing Machinery, Inc
ISBN (Electronic)9781450385633
DOIs
StatePublished - Jul 5 2021
Event2021 Platform for Advanced Scientific Computing Conference, PASC 2021 - Virtual, Online, Switzerland
Duration: Jul 5 2021Jul 9 2021

Publication series

NameProceedings of the Platform for Advanced Scientific Computing Conference, PASC 2021

Conference

Conference2021 Platform for Advanced Scientific Computing Conference, PASC 2021
Country/TerritorySwitzerland
CityVirtual, Online
Period07/5/2107/9/21

Funding

This research is supported by the NSF award #1835817 and the DOE contract #DE-AC05-00OR22725. This work also used the Extreme Science and Engineering Discovery Environment (XSEDE), which is supported by NSF grant number ACI-1548562.

Keywords

  • Cloud computing
  • HPC
  • In-situ data analysis
  • Scientific workflows

Fingerprint

Dive into the research topics of 'X-composer: Enabling cross-environments in-situ workflows between HPC and cloud'. Together they form a unique fingerprint.

Cite this