Skip to main navigation Skip to search Skip to main content

Challenges for Monitoring and Data Analytics in a Leadership Public Data Repository

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

The availability and disposition of data has assumed increasing importance in large-scale computational science. Data repositories are evolving to meet new classes of requirements: compliance with government access guidelines, support for reproducibility of experimental results, and long-term availability of data products. The Constellation public data repository at the Oak Ridge Leadership Computing Facility faces these issues while being situated in one of the most productive data centers in the world. While monitoring and operational data analysis are ingrained in the operation of the OLCF’s large-scale high performance computing platforms, data repositories do not have this history of support. Problems faced by Constellation range from data size (over 7 petabytes in current holdings) to analytic complexity (detailed curation is both absolutely necessary for many data sets and absolutely impossible for humans to accomplish in any practical manner) to deployment environment (OLCF storage resources are oriented toward the needs of the compute platforms). In this paper we describe some of the challenges for collecting monitoring and analytic data from a leadership public data repository. We also discuss various strategies we are pursuing in order to address these challenges, from manual data collection to plans for introducing machine learning-based curatorial techniques.

Original languageEnglish
Title of host publicationHigh Performance Computing. ISC High Performance 2024 International Workshops, Revised Selected Papers
EditorsMichèle Weiland, Sarah Neuwirth, Carola Kruse, Tobias Weinzierl
PublisherSpringer Science and Business Media Deutschland GmbH
Pages287-292
Number of pages6
ISBN (Print)9783031737152
DOIs
StatePublished - 2025
Event39th ISC High Performance conference, ISC-HPC 2024 - Hamburg, Germany
Duration: May 12 2024May 16 2024

Publication series

NameLecture Notes in Computer Science
Volume15058 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference39th ISC High Performance conference, ISC-HPC 2024
Country/TerritoryGermany
CityHamburg
Period05/12/2405/16/24

Funding

This research used resources of the Oak Ridge Leadership Computing Facility at the Oak Ridge National Laboratory, which is supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC05-00OR22725.

Fingerprint

Dive into the research topics of 'Challenges for Monitoring and Data Analytics in a Leadership Public Data Repository'. Together they form a unique fingerprint.

Cite this