TY - GEN
T1 - The tradeoffs of fused memory hierarchies in heterogeneous computing architectures
AU - Spafford, Kyle L.
AU - Meredith, Jeremy S.
AU - Lee, Seyong
AU - Li, Dong
AU - Roth, Philip C.
AU - Vetter, Jeffrey S.
PY - 2012
Y1 - 2012
N2 - With the rise of general purpose computing on graphics processing units (GPGPU), the influence from consumer markets can now be seen across the spectrum of computer architectures. In fact, many of the high-ranking Top500 HPC systems now include these accelerators. Traditionally, GPUs have connected to the CPU via the PCIe bus, which has proved to be a significant bottleneck for scalable scientific applications. Now, a trend toward tighter integration between CPU and GPU has removed this bottleneck and unified the memory hierarchy for both CPU and GPU cores. We examine the impact of this trend for high performance scientific computing by investigating AMD's new Fusion Accelerated Processing Unit (APU) as a testbed. In particular, we evaluate the tradeoffs in performance, power consumption, and programmability when comparing this unified memory hierarchy with similar, but discrete GPUs.
AB - With the rise of general purpose computing on graphics processing units (GPGPU), the influence from consumer markets can now be seen across the spectrum of computer architectures. In fact, many of the high-ranking Top500 HPC systems now include these accelerators. Traditionally, GPUs have connected to the CPU via the PCIe bus, which has proved to be a significant bottleneck for scalable scientific applications. Now, a trend toward tighter integration between CPU and GPU has removed this bottleneck and unified the memory hierarchy for both CPU and GPU cores. We examine the impact of this trend for high performance scientific computing by investigating AMD's new Fusion Accelerated Processing Unit (APU) as a testbed. In particular, we evaluate the tradeoffs in performance, power consumption, and programmability when comparing this unified memory hierarchy with similar, but discrete GPUs.
KW - apu
KW - gpgpu
KW - heterogeneous
KW - performance analysis
UR - https://www.scopus.com/pages/publications/84862695013
U2 - 10.1145/2212908.2212924
DO - 10.1145/2212908.2212924
M3 - Conference contribution
AN - SCOPUS:84862695013
SN - 9781450312158
T3 - CF '12 - Proceedings of the ACM Computing Frontiers Conference
SP - 103
EP - 112
BT - CF '12 - Proceedings of the ACM Computing Frontiers Conference
T2 - ACM Computing Frontiers Conference, CF '12
Y2 - 15 May 2012 through 17 May 2012
ER -