Cholesky Factorization on Heterogeneous CPU and GPU Systems

Jieyang Chen, Zizhong Chen

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

General-purpose graphics processing units (GPGPUs) could bring huge performance improvements in scientific and numerical fields. We presented two approaches utilizing hybrid CPU/GPU system in Cholesky factorization. First, we analyzed the implementation of Cholesky factorization in MAGMA and identified the bottleneck of the current implementation, which is the use of fixed block size without considering any factors in the computing environment. So, we designed an algorithm, which could determine the optimal block size of Cholesky factorization based on multiple factors (input matrix size, CPU/GPU performance, and CPU/GPU bandwidth, etc.). Then, we presented a new improvement on MAGMA's implementation utilize the algorithm. Test results showed that our approach is more efficient than MAGMA's fixed block size implementation under some circumstance. After combining our implementation with MAGMA's implementation, the new hybrid implementation could outperform the current MAGMA implementation. Second, we identified that all the implementations of Cholesky factorization, to our best knowledge, that utilized the GPU do not fully utilized the multicore CPU. So, after studied other researchers approaches, we designed a new algorithm that could utilize multicore CPU and GPU simultaneously in Cholesky factorization. Our approach could keep the block size and workload distribution between CPU and GPU dynamically. Testing results showed the optimal data distribution ratio for our current implementation.

Original languageEnglish
Title of host publicationProceedings - 2015 9th International Conference on Frontier of Computer Science and Technology, FCST 2015
EditorsXiaohua Jia, Yong Zhang, Tharam Dillion, Nei Kato, Yunquan Zhang, Kuan Ching Li, Kui Wu
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages19-26
Number of pages8
ISBN (Electronic)9781467392952
DOIs
StatePublished - Oct 30 2015
Externally publishedYes
Event9th International Conference on Frontier of Computer Science and Technology, FCST 2015 - Dalian, China
Duration: Aug 26 2015Aug 28 2015

Publication series

NameProceedings - 2015 9th International Conference on Frontier of Computer Science and Technology, FCST 2015

Conference

Conference9th International Conference on Frontier of Computer Science and Technology, FCST 2015
Country/TerritoryChina
CityDalian
Period08/26/1508/28/15

Keywords

  • Cholesky Factorization
  • GPU
  • Multicore
  • Numerical algorithm
  • Parallel algorithm

Fingerprint

Dive into the research topics of 'Cholesky Factorization on Heterogeneous CPU and GPU Systems'. Together they form a unique fingerprint.

Cite this