TY - GEN
T1 - Leveraging the performance of LBM-HPC for large sizes on GPUs using ghost cells
AU - Valero-Lara, Pedro
N1 - Publisher Copyright:
© Springer International Publishing AG 2016.
PY - 2016
Y1 - 2016
N2 - Today, we are living a growing demand of larger and more efficient computational resources from the scientific community. On the other hand, the appearance of GPUs for general purpose computing supposed an important advance for covering such demand. These devices offer an impressive computational capacity at low cost and an efficient power consumption. However, the memory available in these devices is (sometimes) not enough, and so it is necessary computationally expensive memory transfers from (to) CPU to (from) GPU, causing a dramatic fall in performance. Recently, the Lattice-Boltzmann Method has positioned as an efficient methodology for fluid simulations. Although this method presents some interesting features particularly amenable to be efficiently exploited on parallel computers, it requires a considerable memory capacity, which can suppose an important drawback, in particular, on GPUs. In the present paper, it is proposed a new GPU-based implementation, which minimizes such requirements with respect to other state-of-theart implementations. It allows us to execute almost 2× bigger problems without additional memory transfers, achieving faster executions when dealing with large problems.
AB - Today, we are living a growing demand of larger and more efficient computational resources from the scientific community. On the other hand, the appearance of GPUs for general purpose computing supposed an important advance for covering such demand. These devices offer an impressive computational capacity at low cost and an efficient power consumption. However, the memory available in these devices is (sometimes) not enough, and so it is necessary computationally expensive memory transfers from (to) CPU to (from) GPU, causing a dramatic fall in performance. Recently, the Lattice-Boltzmann Method has positioned as an efficient methodology for fluid simulations. Although this method presents some interesting features particularly amenable to be efficiently exploited on parallel computers, it requires a considerable memory capacity, which can suppose an important drawback, in particular, on GPUs. In the present paper, it is proposed a new GPU-based implementation, which minimizes such requirements with respect to other state-of-theart implementations. It allows us to execute almost 2× bigger problems without additional memory transfers, achieving faster executions when dealing with large problems.
KW - CUDA
KW - Computational fluid dynamics
KW - GPU
KW - Lattice-Boltzmann method
UR - http://www.scopus.com/inward/record.url?scp=85007153898&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-49583-5_31
DO - 10.1007/978-3-319-49583-5_31
M3 - Conference contribution
AN - SCOPUS:85007153898
SN - 9783319495828
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 417
EP - 430
BT - Algorithms and Architectures for Parallel Processing - 16th International Conference, ICA3PP 2016, Proceedings
A2 - Carretero, Jesus
A2 - Nakano, Koji
A2 - Ko, Ryan K.L.
A2 - Mueller, Peter
A2 - Garcia-Blas, Javier
PB - Springer Verlag
T2 - 16th International Conference on Algorithms and Architectures for Parallel Processing, ICA3PP 2016
Y2 - 14 December 2016 through 16 December 2016
ER -