Abstract
Numerical techniques used for describing many-body systems, such as the Coupled Cluster methods (CC) of the quantum chemistry package NWChem, are of extreme interest to the computational chemistry community in fields such as catalytic reactions, solar energy, and bio-mass conversion. In spite of their importance, many of these computationally intensive algorithms have traditionally been thought of in a fairly linear fashion, or are parallelized in coarse chunks. In this paper, we present our effort of converting the NWChem’s CC code into a dataflow-based form that is capable of utilizing the task scheduling system PaRSEC (Parallel Runtime Scheduling and Execution Controller): a software package designed to enable high-performance computing at scale. We discuss the modularity of our approach and explain how the PaRSEC-enabled dataflow version of the subroutines seamlessly integrate into the NWChem codebase. Furthermore, we argue how the CC algorithms can be easily decomposed into finer-grained tasks (compared with the original version of NWChem); and how data distribution and load balancing are decoupled and can be tuned independently. We demonstrate performance acceleration by more than a factor of two in the execution of the entire CC component of NWChem, concluding that the utilization of dataflow-based execution for CC methods enables more efficient and scalable computation.
Original language | English |
---|---|
Pages (from-to) | 540-551 |
Number of pages | 12 |
Journal | International Journal of High Performance Computing Applications |
Volume | 32 |
Issue number | 4 |
DOIs | |
State | Published - Jul 1 2018 |
Funding
We thank the anonymous reviewers for their improvement suggestions. A portion of this research was performed using EMSL, a DOE Office of Science User Facility sponsored by the Office of Biological and Environmental Research and located at Pacific Northwest National Laboratory. The authorr((ss))ddisicslcolsoesdedrerceecipetipotf othfethfoellfoowllionwg ifninganficniaalnscuipa-l spuoprtpofortr fthoer rtehseearrecshe,aaructhh,oarsuhtihpo,rashndip/o,ranpdub/olircaptiuobnliocafttihoins ofarticlthise:Thearticle:authorsThisdismaterialclosedreciseipbasedtoftheuponfollowworkingfinsup-an- ported in part by the Air Force Office of Scientific Research under AFOSR Award Number FA9550-12-1-0476, and the DOE Office of Science, Advanced Scien-AFOSR Award Number FA9550-12-1-0476, and the DOE tific Computing Research, under award number DE-Office of Science, Advanced Scientific Computing Research, SC0006733 “SUPER - Institute for Sustained Perfor-under award number DE-SC0006733 ‘‘SUPER - Institute for mance,SustainedEnergyPerformaandnceResilience”.,EnergyandResilience’’.
Keywords
- CCSD
- DAG
- NWChem
- PTG
- PaRSEC
- dataflow
- tasks