An implementation of the tile QR factorization for a GPU and multiple CPUs

Jakub Kurzak, Rajib Nath, Peng Du, Jack Dongarra

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

9 Scopus citations

Abstract

The tile QR factorization provides an efficient and scalable way for factoring a dense matrix in parallel on multicore processors. This article presents a way of efficiently implementing the algorithm on a system with a powerful GPU and many multicore CPUs.

Original languageEnglish
Title of host publicationApplied Parallel and Scientific Computing - 10th International Conference, PARA 2010, Revised Selected Papers
Pages248-257
Number of pages10
EditionPART 2
DOIs
StatePublished - 2012
Event10th International Conference on Applied Parallel and Scientific Computing, PARA 2010 - Reykjavik, Iceland
Duration: Jun 6 2010Jun 9 2010

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 2
Volume7134 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference10th International Conference on Applied Parallel and Scientific Computing, PARA 2010
Country/TerritoryIceland
CityReykjavik
Period06/6/1006/9/10

Fingerprint

Dive into the research topics of 'An implementation of the tile QR factorization for a GPU and multiple CPUs'. Together they form a unique fingerprint.

Cite this