SCISPACE: A scientific collaboration workspace for geo-distributed HPC data centers

Awais Khan, Taeuk Kim, Hyunki Byun, Youngjae Kim

Research output: Contribution to journalArticlepeer-review

20 Scopus citations

Abstract

Future terabit networks are committed to dramatically improving big data motion between geographically dispersed HPC data centers. The scientific community takes advantage of the terabit networks such as DOE's ESnet and accelerates the trend to build a small world of collaboration between geospatial HPC data centers. It improves information and resource sharing for joint simulation and analysis between the HPC data centers. However, there exist several challenges for effective collaborations such as a collective view of multi-site shared data, minimal performance degradation of scientific applications running in a such collaboration environments and critical of all, data sharing policies in such collaborations. In this paper, we propose to build SCISPACE, Scientific Collaboration Workspace for collaborative data centers. It provides a global view of information shared from multiple geo-distributed HPC data centers under a single workspace. SCISPACE supports native data-access to gain high-performance when data read or write is required in native data center namespace. It is accomplished by integrating an on-demand metadata export protocol. To optimize scientific collaborations across HPC data centers, SCISPACE implements search and discovery service. To evaluate, we configured two geo-distributed small-scale HPC data centers connected via high-speed Infiniband network such as terabits network of DOE's ESnet, equipped with LustreFS. We show the feasibility of SCISPACE using real scientific datasets and applications. The evaluation results show average 36% performance boost when the proposed native-data access is employed in collaborations. We also emulate a real climate science collaboration to validate the usefulness of SCISPACE.

Original languageEnglish
Pages (from-to)398-409
Number of pages12
JournalFuture Generation Computer Systems
Volume101
DOIs
StatePublished - Dec 2019
Externally publishedYes

Funding

This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea Government (Ministry of Science and ICT) under Grant 2018R1A1A1A05079398 and Institute for Information & communication Technology (IITP) grant funded by the Korea government (MSIT) (No. 2015-0-00590 , High Performance Big Data Analytics Platform Performance Acceleration Technologies Development).

FundersFunder number
Institute for Information & communication Technology
Ministry of Science and ICT
National Research Foundation of Korea2018R1A1A1A05079398
Institute for Information and Communications Technology Promotion2015-0-00590
Ministry of Science and ICT, South Korea

    Keywords

    • File systems
    • Geo-distributed data centers
    • Scientific collaborations

    Fingerprint

    Dive into the research topics of 'SCISPACE: A scientific collaboration workspace for geo-distributed HPC data centers'. Together they form a unique fingerprint.

    Cite this