TY - GEN
T1 - Scalability Issues in FFT Computation
AU - Ayala, Alan
AU - Tomov, Stanimire
AU - Stoyanov, Miroslav
AU - Dongarra, Jack
N1 - Publisher Copyright:
© 2021, Springer Nature Switzerland AG.
PY - 2021
Y1 - 2021
N2 - The fast Fourier transform (FFT), is one the most important tools in mathematics, and it is widely required by several applications of science and engineering. State-of-the-art parallel implementations of the FFT algorithm, based on Cooley-Tukey developments, are known to be communication-bound, which causes critical issues when scaling the computational and architectural capabilities. In this paper, we study the main performance bottleneck of FFT computations on hybrid CPU and GPU systems at large-scale. We provide numerical simulations and potential acceleration techniques that can be easily integrated into FFT distributed libraries. We present different experiments on performance scalability and runtime analysis on the world’s most powerful supercomputers today: Summit, using up to 6,144 NVIDIA V100 GPUs, and Fugaku, using more than one million Fujitsu A64FX cores.
AB - The fast Fourier transform (FFT), is one the most important tools in mathematics, and it is widely required by several applications of science and engineering. State-of-the-art parallel implementations of the FFT algorithm, based on Cooley-Tukey developments, are known to be communication-bound, which causes critical issues when scaling the computational and architectural capabilities. In this paper, we study the main performance bottleneck of FFT computations on hybrid CPU and GPU systems at large-scale. We provide numerical simulations and potential acceleration techniques that can be easily integrated into FFT distributed libraries. We present different experiments on performance scalability and runtime analysis on the world’s most powerful supercomputers today: Summit, using up to 6,144 NVIDIA V100 GPUs, and Fugaku, using more than one million Fujitsu A64FX cores.
KW - Hybrid systems
KW - Parallel FFT
KW - Scalability
UR - http://www.scopus.com/inward/record.url?scp=85115336216&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-86359-3_21
DO - 10.1007/978-3-030-86359-3_21
M3 - Conference contribution
AN - SCOPUS:85115336216
SN - 9783030863586
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 279
EP - 287
BT - Parallel Computing Technologies - 16th International Conference, PaCT 2021, Proceedings
A2 - Malyshkin, Victor
PB - Springer Science and Business Media Deutschland GmbH
T2 - 16th International Conference on Parallel Computing Technologies, PaCT 2021
Y2 - 13 September 2021 through 18 September 2021
ER -