Abstract
Efficiently porting ordinary applications to Blue Gene/Q supercomputers is a significant challenge. Codes are often originally developed without considering advanced architectures and related tool chains. Science needs frequently lead users to want to run large numbers of relatively small jobs (often called many-task computing, an ensemble, or a workflow), which can conflict with supercomputer configurations. In this paper, we discuss techniques developed to execute ordinary applications over leadership class supercomputers. We use the high-performance Swift parallel scripting framework and build two workflow execution techniques - sub-jobs and main-wrap. The sub-jobs technique, built on top of the IBM Blue Gene/Q resource manager Cobalt's sub-block jobs, lets users submit multiple, independent, repeated smaller jobs within a single larger resource block. The main-wrap technique is a scheme that enables C/C++ programs to be defined as functions that are wrapped by a high-performance Swift wrapper and that are invoked as a Swift script. We discuss the needs, benefits, technicalities, and current limitations of these techniques. We further discuss the real-world science enabled by these techniques and the results obtained.
Original language | English |
---|---|
Title of host publication | Proceedings - 11th IEEE International Conference on eScience, eScience 2015 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 420-428 |
Number of pages | 9 |
ISBN (Electronic) | 9781467393256 |
DOIs | |
State | Published - Oct 22 2015 |
Externally published | Yes |
Event | 11th IEEE International Conference on eScience, eScience 2015 - Munich, Germany Duration: Aug 31 2015 → Sep 4 2015 |
Publication series
Name | Proceedings - 11th IEEE International Conference on eScience, eScience 2015 |
---|
Conference
Conference | 11th IEEE International Conference on eScience, eScience 2015 |
---|---|
Country/Territory | Germany |
City | Munich |
Period | 08/31/15 → 09/4/15 |
Funding
This work was supported in part by the U.S. Dept. of Energy, Office of Science, Office of Advanced Scientific Computing Research, under Contract DE-AC02-06CH11357. We thank Kevin N. Harms and Raymond M. Loy of ALCF for help with sub-block jobs and application installation. Work by Katz was supported by the National Science Foundation while working at the Foundation. Any opinion, finding, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.
Keywords
- BG/Q
- Supercomputers
- Swift