Abstract
As applications target extreme scales, data staging and in-situ/in-transit data processing have been proposed to address the data challenges and improve scientific discovery. However, further research is necessary in order to understand how growing data sizes from data intensive simulations coupled with the limited DRAM capacity in High End Computing systems will impact the effectiveness of this approach. In this paper, we explore how we can use deep memory levels for data staging, and develop a multi-tiered data staging method that spans bothDRAM and solid state disks (SSD). This approach allows us to support both code coupling and data management for data intensive simulation workflows. We also show how an adaptive application-aware data placement mechanism can dynamically manage and optimize data placement across the DRAM ands storage levels in this multi-tiered data staging method. We present an experimental evaluation of our approach using wolf resources: an Infiniband cluster (Sith) and a Cray XK7system (Titan), and using combustion (S3D) and fusion (XGC1) simulations.
Original language | English |
---|---|
Title of host publication | Proceedings - 2015 IEEE 29th International Parallel and Distributed Processing Symposium, IPDPS 2015 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 1033-1042 |
Number of pages | 10 |
ISBN (Electronic) | 9781479986484 |
DOIs | |
State | Published - Jul 17 2015 |
Event | 29th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2015 - Hyderabad, India Duration: May 25 2015 → May 29 2015 |
Publication series
Name | Proceedings - 2015 IEEE 29th International Parallel and Distributed Processing Symposium, IPDPS 2015 |
---|
Conference
Conference | 29th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2015 |
---|---|
Country/Territory | India |
City | Hyderabad |
Period | 05/25/15 → 05/29/15 |
Funding
The research presented in this work is supported in part by the US National Science Foundation (NSF) via grants numbers ACI 1339036, ACI 1310283, DMS 1228203 and IIP 0758566, by the Director, Office of Advanced Scientific Computing Research, Office of Science, of the U.S. Department of Energy through the Scientific Discovery through Advanced Computing (SciDAC) Institute of Scalable Data Management, Analysis and Visualization (SDAV) under award number DE-SC0007455, the Advanced Scientific Computing Research and Fusion Energy Sciences Partnership for Edge Physics Simulations (EPSI) under award number DE-FG02-06ER54857, the ExaCT Combustion Co-Design Center via subcontract number 4000110839 from UT Battelle, and the RSVP grant via subcontract number 4000126989 from UT Battelle. The research was conducted as part of the NSF Cloud and Autonomic Computing (CAC) Center at Rutgers University and the Rutgers Discovery Informatics Institute (RDI2).
Keywords
- Coupling
- Data management
- adaptation
- data placement
- data staging
- deep memory hierarchy
- exascale
- multi-tiered
- ssd