Tridiagonalization of a symmetric dense matrix on a GPU cluster

Ichitaro Yamazaki, Tingxing Dong, Stanimire Tomov, Jack Dongarra

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

Symmetric dense Eigen value problems arise in many scientific and engineering simulations. In this paper, we use GPUs to accelerate its main computational kernel, the tridiagonalization of a dense symmetric matrix on a distributed multicore architecture. We then study the performance of this hybrid message-passing/shared-memory/GPU-computing paradigm on up to 16 compute nodes, each of which consists of 16 Intel Sandy Bridge processors and three NVIDIA GPUs. These studies show that such a hybrid paradigm can exploit the underlying hardware architecture and obtain significant speedups over a flat message-passing paradigm can, and they demonstrate a potential of efficiently solving large-scale Eigen value problems on a GPU cluster. Furthermore, these studies may provide insights on the general effects of such hybrid paradigms on emerging high-performance computers.

Original languageEnglish
Title of host publicationProceedings - IEEE 27th International Parallel and Distributed Processing Symposium Workshops and PhD Forum, IPDPSW 2013
PublisherIEEE Computer Society
Pages1070-1079
Number of pages10
ISBN (Print)9780769549798
DOIs
StatePublished - 2013
Event2013 IEEE 37th Annual Computer Software and Applications Conference, COMPSAC 2013 - Boston, MA, Japan
Duration: Jul 22 2013Jul 26 2013

Publication series

NameProceedings - IEEE 27th International Parallel and Distributed Processing Symposium Workshops and PhD Forum, IPDPSW 2013

Conference

Conference2013 IEEE 37th Annual Computer Software and Applications Conference, COMPSAC 2013
Country/TerritoryJapan
CityBoston, MA
Period07/22/1307/26/13

Keywords

  • GPU cluster
  • dense symmetric tridiagonalization
  • distributed multicores
  • hybrid programming

Fingerprint

Dive into the research topics of 'Tridiagonalization of a symmetric dense matrix on a GPU cluster'. Together they form a unique fingerprint.

Cite this