Abstract
The Locally Self-consistent Multiple Scattering (LSMS) code solves the first principles Density Functional theory Kohn-Sham equation for a wide range of materials with a special focus on metals, alloys and metallic nano-structures. It has traditionally exhibited near perfect scalability on massively parallel high performance computer architectures. We present our efforts to exploit GPUs to accelerate the LSMS code to enable first principles calculations of O(100,000) atoms and statistical physics sampling of finite temperature properties. Using the Cray XK7 system Titan at the Oak Ridge Leadership Computing Facility we achieve a sustained performance of 14.5PFlop/s and a speedup of 8.6 compared to the CPU only code.
Original language | English |
---|---|
Title of host publication | Big Data Technology and Applications - 1st National Conference, BDTA 2015, Proceedings |
Editors | Gansen Zhao, Zeguang Lu, Wenguang Chen, Guisheng Yin, Qilong Han, Weipeng Jing, Guanglu Sun |
Publisher | Springer Verlag |
Pages | 259-268 |
Number of pages | 10 |
ISBN (Print) | 9789811004568 |
DOIs | |
State | Published - 2016 |
Event | 1st National Conference on Big Data Technology and Applications, BDTA 2015 - Harbin, China Duration: Dec 25 2015 → Dec 26 2015 |
Publication series
Name | Communications in Computer and Information Science |
---|---|
Volume | 590 |
ISSN (Print) | 1865-0929 |
Conference
Conference | 1st National Conference on Big Data Technology and Applications, BDTA 2015 |
---|---|
Country/Territory | China |
City | Harbin |
Period | 12/25/15 → 12/26/15 |
Funding
This work has been sponsored by the U.S. Department of Energy, Office of Science, Basic Energy Sciences, Material Sciences and Engineering Division (basic theory and applications) and by the Office of Advanced Scientific Computing (software optimization and performance measurements). This research used resources of the Oak Ridge Leadership Computing Facility, which is supported by the Office of Science of the U.S. Department of Energy under contract no. DE-AC05-00OR22725 .