Enabling High- Throughput Parallel I/O in Particle-in-Cell Monte Carlo Simulations with openPMD and Darshan I/O Monitoring

Jeremy J. Williams, Daniel Medeiros, Stefan Costea, David Tskhakaya, Franz Poeschel, Rene Widera, Axel Huebl, Scott Klasky, Norbert Podhorszki, Leon Kos, Ales Podolnik, Jakub Hromadka, Tapish Narwal, Klaus Steiniger, Michael Bussmann, Erwin Laure, Stefano Markidis

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Large-scale HPC simulations of plasma dynamics in fusion devices require efficient parallel I/O to avoid slowing down the simulation and to enable the post-processing of critical information. Such complex simulations lacking parallel I/O capabilities may encounter performance bottlenecks, hindering their effectiveness in data-intensive computing tasks. In this work, we focus on introducing and enhancing the efficiency of parallel I/O operations in Particle-in-Cell Monte Carlo simu-lations. We first evaluate the scalability of BIT1, a massively-parallel electrostatic PIC MC code, determining its initial write throughput capabilities and performance bottlenecks using an HPC I/O performance monitoring tool, Darshan. We design and develop an adaptor to the openPMD I/O interface that allows us to stream PIC particle and field information to I/O using the BP4 backend, aggressively optimized for I/O efficiency, including the highly efficient ADIOS2 interface. Next, we explore advanced optimization techniques such as data compression, aggregation, and Lustre file striping, achieving write throughput improvements while enhancing data storage efficiency. Finally, we analyze the enhanced high-throughput parallel I/O and storage capabilities achieved through the integration of openPMD with rapid metadata extraction in BP4 format. Our study demonstrates that the integration of openPMD and advanced I/O optimizations significantly enhances BIT1's I/O performance and storage capabilities, successfully introducing high throughput parallel I/O and surpassing the capabilities of traditional file I/O.

Original languageEnglish
Title of host publicationProceedings - 2024 IEEE International Conference on Cluster Computing Workshops, CLUSTER Workshops 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages86-95
Number of pages10
ISBN (Electronic)9798350383454
DOIs
StatePublished - 2024
Event2024 IEEE International Conference on Cluster Computing Workshops, CLUSTER Workshops 2024 - Kobe, Japan
Duration: Sep 24 2024Sep 27 2024

Publication series

NameProceedings - 2024 IEEE International Conference on Cluster Computing Workshops, CLUSTER Workshops 2024

Conference

Conference2024 IEEE International Conference on Cluster Computing Workshops, CLUSTER Workshops 2024
Country/TerritoryJapan
CityKobe
Period09/24/2409/27/24

Keywords

  • ADIOS2
  • Darshan
  • Distributed Storage
  • Efficient Data Processing
  • Large-Scale PIC Simulations
  • openPMD
  • Parallel I/O

Fingerprint

Dive into the research topics of 'Enabling High- Throughput Parallel I/O in Particle-in-Cell Monte Carlo Simulations with openPMD and Darshan I/O Monitoring'. Together they form a unique fingerprint.

Cite this