The fat-link computation on large GPU clusters for lattice QCD

Guochun Shi, Ronald Babich, Michael A. Clark, Bálint Joó, Steven Gottlieb, Volodymyr Kindratenko

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Graphics Processing Units (GPU) are becoming increasingly popular in high performance computing due to their high performance, high power efficiency and low cost. In this paper, we present results of an effort to implement the fatlink computation - an important component of many lattice quantum chromodynamics (LQCD) calculations - on GPU clusters using the QUDA framework. Two implementations, one similar to the original CPU algorithm in the MILC code and one based on the idea of reduced communication by redundant computations, are presented and their relative advantages are discussed. In strong-scaling tests on up to 384 GPUs on Longhorn and 256 GPUs on Keeneland GPU clusters, where the CPU core to GPU ratio is 4:1 in both clusters, we achieved up to 11.4x and 8.7x node speedup when running on the two GPU clusters, respectively.

Original languageEnglish
Title of host publicationProceedings - 2012 Symposium on Application Accelerators in High Performance Computing, SAAHPC 2012
Pages1-10
Number of pages10
DOIs
StatePublished - 2012
Externally publishedYes
Event2012 Symposium on Application Accelerators in High Performance Computing, SAAHPC 2012 - Argonne, IL, United States
Duration: Jul 10 2012Jul 11 2012

Publication series

NameSymposium on Application Accelerators in High-Performance Computing
ISSN (Print)2166-5133
ISSN (Electronic)2166-515X

Conference

Conference2012 Symposium on Application Accelerators in High Performance Computing, SAAHPC 2012
Country/TerritoryUnited States
CityArgonne, IL
Period07/10/1207/11/12

Keywords

  • CUDA
  • GPU
  • Lattice QCD
  • MILC
  • QUDA
  • Quantum chromodynamics

Fingerprint

Dive into the research topics of 'The fat-link computation on large GPU clusters for lattice QCD'. Together they form a unique fingerprint.

Cite this