Abstract
As high-performance computing (HPC) is being scaled up to exascale to accommodate new modeling and simulation needs, I/O has continued to be a major bottleneck in the end-to-end scientific processes. Nevertheless, prior work in this area mostly aimed to maximize the average performance, and there has been a lack of study and solutions that can manage I/O performance variation on HPC systems. This work aims to take advantage of the storage characteristics and explore application level solutions that are interference-aware. In particular, we monitor the performance of data analytics and estimate the state of shared storage resources using discrete fourier transform (DFT). If heavy I/O interference is predicted to occur at a given timestep, data analytics can dynamically adapt to the environment by lowering the accuracy and performing partial or no augmentation from the shared storage, dictated by an augmentation-bandwidth plot. We evaluate three data analytics, XGC, GenASiS, and Jet, on Chameleon, and quantitatively demonstrate that both the average and variation of I/O performance can be vastly improved using our dynamic augmentation, with the mean and variance improved by as much as 67% and 96%, respectively, while maintaining acceptable outcome of data analysis.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of SC 2020 |
| Subtitle of host publication | International Conference for High Performance Computing, Networking, Storage and Analysis |
| Publisher | IEEE Computer Society |
| ISBN (Electronic) | 9781728199986 |
| DOIs | |
| State | Published - Nov 2020 |
| Event | 2020 International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2020 - Virtual, Online, United States Duration: Nov 9 2020 → Nov 19 2020 |
Publication series
| Name | International Conference for High Performance Computing, Networking, Storage and Analysis, SC |
|---|---|
| Volume | 2020-November |
| ISSN (Print) | 2167-4329 |
| ISSN (Electronic) | 2167-4337 |
Conference
| Conference | 2020 International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2020 |
|---|---|
| Country/Territory | United States |
| City | Virtual, Online |
| Period | 11/9/20 → 11/19/20 |
Funding
ACKNOWLEDGMENTS The authors wish to acknowledge the support from the US Department of Energy Exascale Computing Project (17-SC-20-SC), Chameleon Cloud, and US National Science Foundation CCF-1718297, CCF-1812861.
Keywords
- High performance computing
- data analysis
- data storage
Fingerprint
Dive into the research topics of 'Taming i/o variation on qos-less hpc storage: What can applications do?'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver