Abstract
Science applications frequently produce and consume large volumes of data, but delivering this data to and from compute resources can be challenging, as parallel file system performance is not keeping up with compute and memory performance. To mitigate this I/O bottleneck, some systems have deployed burst buffers, but their impact on performance for real-world workflow applications is not always clear. In this paper, we examine the impact of burst buffers through the remote-shared, allocatable burst buffers on the Cori system at NERSC. By running a subset of the SCEC CyberShake workflow, a production seismic hazard analysis workflow, we find that using burst buffers offers read and write improvements of about an order of magnitude, and these improvements lead to increased job performance, even for long-running CPU-bound jobs.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of WORKS 2017 |
| Subtitle of host publication | 12th Workshop on Workflows in Support of Large-Scale Science - Held in conjunction with SC 2017: The International Conference for High Performance Computing, Networking, Storage and Analysis |
| Publisher | Association for Computing Machinery, Inc |
| ISBN (Print) | 9781450351294 |
| DOIs | |
| State | Published - Nov 12 2017 |
| Externally published | Yes |
| Event | 12th Workshop on Workflows in Support of Large-Scale Science, WORKS 2017 - Held in conjunction with the International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2017 - Denver, United States Duration: Nov 12 2017 → Nov 17 2017 |
Publication series
| Name | Proceedings of WORKS 2017: 12th Workshop on Workflows in Support of Large-Scale Science - Held in conjunction with SC 2017: The International Conference for High Performance Computing, Networking, Storage and Analysis |
|---|
Conference
| Conference | 12th Workshop on Workflows in Support of Large-Scale Science, WORKS 2017 - Held in conjunction with the International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2017 |
|---|---|
| Country/Territory | United States |
| City | Denver |
| Period | 11/12/17 → 11/17/17 |
Funding
CyberShake workflow research was supported by the National Science Foundation (NSF) under the OAC SI2-SSI grant #1148493, the OAC SI2-SSI grant #1450451, and EAR grant #1226343. This research was supported by the Southern California Earthquake Center (Contribution No. 7610). SCEC is funded by NSF Cooperative Agreement EAR-1033462 & USGS Cooperative Agreement G12AC20038. This work was funded by DOE contract number #DESC0012636, “Panorama – Predictive Modeling and Diagnostic Monitoring of Extreme Science Workflows”, and by NSF contract number #1664162, “ SI2-SSI: Pegasus: Automating Compute and Data Intensive Science”. This research used resources of the National Energy Research Scientific Computing Center, a DOE Office of Science User Facility supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC02-05CH11231.
Keywords
- Burst Buffers
- High-Performance Computing
- In Transit Processing
- Scientific Workflows
Fingerprint
Dive into the research topics of 'On the use of burst buffers for accelerating data-intensive scientific work flows'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver