From Edge to HPC: Investigating Cross-Facility Data Streaming Architectures

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In this paper, we investigate three cross-facility data streaming architectures, Direct Streaming (DTS), Proxied Streaming (PRS), and Managed Service Streaming (MSS). We examine their architectural variations in data flow paths and deployment feasibility, and detail their implementation using the Data Streaming to HPC (DS2HPC) architectural framework and the SciStream memory-to-memory streaming toolkit on the production-grade Advanced Computing Ecosystem (ACE) infrastructure at Oak Ridge Leadership Computing Facility (OLCF). We present a workflow-specific evaluation of these architectures using three synthetic workloads derived from the streaming characteristics of scientific workflows. Through simulated experiments, we measure streaming throughput, round-trip time, and overhead under work sharing, work sharing with feedback, and broadcast and gather messaging patterns commonly found in AI-HPC communication motifs. Our study shows that DTS offers a minimal-hop path, resulting in higher throughput and lower latency, whereas MSS provides greater deployment feasibility and scalability across multiple users but incurs significant overhead. PRS lies in between, offering a scalable architecture whose performance matches DTS in most cases.

Original languageEnglish
Title of host publicationProceedings of 2025 Workshops of the International Conference on High Performance Computing, Network, Storage, and Analysis, SC 2025 Workshops
PublisherAssociation for Computing Machinery, Inc
Pages949-959
Number of pages11
ISBN (Electronic)9798400718717
DOIs
StatePublished - Nov 15 2025
Event2025 Workshops of the International Conference on High Performance Computing, Network, Storage, and Analysis, SC 2025 Workshops - St. Louis, United States
Duration: Nov 16 2025Nov 21 2025

Publication series

NameProceedings of 2025 Workshops of the International Conference on High Performance Computing, Network, Storage, and Analysis, SC 2025 Workshops

Conference

Conference2025 Workshops of the International Conference on High Performance Computing, Network, Storage, and Analysis, SC 2025 Workshops
Country/TerritoryUnited States
CitySt. Louis
Period11/16/2511/21/25

Funding

We thank our colleagues Nick Schmitt, Ethan O’Dell, Steven Lu, Tyler Skluzacek, A.J. Ruckman, Paul Bryant, and Gustav Jansen at ORNL for their help in resolving technical challenges during the deployment of the streaming architectures and for providing valuable information and clarifications. We also thank the SciStream team at ANL, Flavio Castro and Rajkumar Kettimuthu, for helping us better understand SciStream’s capabilities. This research used resources of the Oak Ridge Leadership Computing Facility located at Oak Ridge National Laboratory, which is supported by the Office of Science of the Department of Energy under contract No. DEAC05-00OR22725.

Keywords

  • Data streaming
  • HPC
  • Integrated Research Infrastructure (IRI)
  • Latency
  • Proxy
  • RabbitMQ
  • SciStream
  • Scientific workflows
  • Streaming service
  • Throughput

Fingerprint

Dive into the research topics of 'From Edge to HPC: Investigating Cross-Facility Data Streaming Architectures'. Together they form a unique fingerprint.

Cite this