DataSpaces: An interaction and coordination framework for coupled simulation workflows

Ciprian Docan, Manish Parashar, Scott Klasky

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    99 Scopus citations

    Abstract

    Emerging high-performance distributed computing environments are enabling new end-to-end formulations in science and engineering that involve multiple interacting processes and data-intensive application workflows. For example, current fusion simulation efforts are exploring coupled models and codes that simultaneously simulate separate application processes, such as the core and the edge turbulence, and run on different high performance computing resources. These components need to interact, at runtime, with each other and with services for data monitoring, data analysis and visualization, and data archiving. As a result, they require efficient support for dynamic and flexible couplings and interactions, which remains a challenge. This paper presents Data-Spaces, a flexible interaction and coordination substrate that addresses this challenge. DataSpaces essentially implements a semantically specialized virtual shared space abstraction that can be associatively accessed by all components and services in the application workflow. It enables live data to be extracted from running simulation components, indexes this data online, and then allows it to be monitored, queried and accessed by other components and services via the space using semantically meaningful operators. The underlying data transport is asynchronous, low-overhead and largely memory-to-memory. The design, implementation, and experimental evaluation of DataSpaces using a coupled fusion simulation workflow is presented.

    Original languageEnglish
    Title of host publicationHPDC 2010 - Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
    Pages25-36
    Number of pages12
    DOIs
    StatePublished - 2010
    Event19th ACM International Symposium on High Performance Distributed Computing, HPDC 2010 - Chicago, IL, United States
    Duration: Jun 21 2010Jun 25 2010

    Publication series

    NameHPDC 2010 - Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing

    Conference

    Conference19th ACM International Symposium on High Performance Distributed Computing, HPDC 2010
    Country/TerritoryUnited States
    CityChicago, IL
    Period06/21/1006/25/10

    Keywords

    • Code coupling
    • Data redistribution
    • I/O
    • RDMA
    • Workflows

    Fingerprint

    Dive into the research topics of 'DataSpaces: An interaction and coordination framework for coupled simulation workflows'. Together they form a unique fingerprint.

    Cite this