Abstract
As computing systems approach the limits of traditional silicon technology, the diminishing returns in performance per watt present a significant barrier to sustaining growth in HPC. From a large-scale scientific supercomputing facility point of view, we propose a multifaceted strategy toward specialized hardware and architectures that are optimized for energy efficiency in specific applications. We also emphasize the need for integrating energy-aware practices across all levels of HPC, from system design and software development to operational policies. We discuss strategic opportunities such as the adoption of application-specific accelerators, the development of energy-efficient algorithms, and the implementation of data-driven operational analytics. Our goal is to develop a comprehensive roadmap ensuring that future leadership systems at OLCF can meet scientific demands while operating within stringent energy budgets, thereby supporting sustainable computing growth.
Original language | English |
---|---|
Title of host publication | Proceedings of SC 2024-W |
Subtitle of host publication | Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 1790-1794 |
Number of pages | 5 |
ISBN (Electronic) | 9798350355543 |
DOIs | |
State | Published - 2024 |
Event | 2024 Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis, SC Workshops 2024 - Atlanta, United States Duration: Nov 17 2024 → Nov 22 2024 |
Publication series
Name | Proceedings of SC 2024-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis |
---|
Conference
Conference | 2024 Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis, SC Workshops 2024 |
---|---|
Country/Territory | United States |
City | Atlanta |
Period | 11/17/24 → 11/22/24 |
Funding
This research used resources of the Oak Ridge Leadership Computing Facility at the Oak Ridge National Laboratory, which is supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC05-00OR22725. We thank Phil Roth, Olga Kuchar, Valentine Anantharaj, Frederic Suter, Wes Brewer, David Rogers, Patrick Widener, Renan Santos Souza, Aditya Kashi, Meghan Berry, Joshua Brown, Blake Nedved, Tatiyanna Singleton, Naw Safrin Sattar, Dan Dietz, Ana Gainaru, Rajni Chahal, Siyan Liu, Eva Zarkadoula, Norbert Podhorszki, Scott Klasky, Takuya Kurihana, Dalton Lunga, H. Casanova, Greg Eisenhauer, Gabriele Padovani, Sandro Fioreand, and Shantenu Jha for insightful discussions.
Keywords
- HPC Energy Efficiency
- Post-Exascale Supercomputing