Targeting atmospheric simulation algorithms for large, distributed-memory, GPU-accelerated computers

Research output: Chapter in Book/Report/Conference proceedingChapterpeer-review

2 Scopus citations

Abstract

Computing platforms are increasingly moving to accelerated architectures, and here we deal particularly with GPUs. In Norman et al. (2011), a method was developed for atmospheric simulation to improve efficiency on large, distributed-memory machines by reducing communication demand and increasing the time step. Here, we improve upon this method to further target GPU-accelerated platforms by reducing GPU memory accesses, removing a synchronization point, and clustering computations. The modified code ran more than two times faster than the original in some cases even though more computations were required, demonstrating the importance of improving memory handling on the GPU. Furthermore, we discovered that the modification also has a near 100 % hit rate in fast, on-chip L1 cache and discuss the reasons for this. Finally, we remark on further potential improvements to GPU efficiency.

Original languageEnglish
Title of host publicationLecture Notes in Earth System Sciences
PublisherSpringer International Publishing
Pages271-282
Number of pages12
Edition9783642164040
DOIs
StatePublished - 2013

Publication series

NameLecture Notes in Earth System Sciences
Number9783642164040
Volume0
ISSN (Print)2193-8571
ISSN (Electronic)2193-858X

Bibliographical note

Publisher Copyright:
© Springer-Verlag Berlin Heidelberg 2013.

Fingerprint

Dive into the research topics of 'Targeting atmospheric simulation algorithms for large, distributed-memory, GPU-accelerated computers'. Together they form a unique fingerprint.

Cite this