Dynamic task scheduling for linear algebra algorithms on distributed-memory multicore systems

Fengguang Song, Asim Yarkhan, Jack Dongarra

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

68 Scopus citations

Abstract

This paper presents a dynamic task scheduling approach to executing dense linear algebra algorithms on multicore systems (either shared-memory or distributed-memory). We use a task-based library to replace the existing linear algebra subroutines such as PBLAS to transparently provide the same interface and computational function as the ScaLAPACK library. Linear algebra programs are written with the task-based library and executed by a dynamic runtime system. We mainly focus our runtime system design on the metric of performance scalability. We propose a distributed algorithm to solve data dependences without process cooperation. We have implemented the runtime system and applied it to three linear algebra algorithms: Cholesky, LU, and QR factorizations. Our experiments on both shared-memory machines (16, 32 cores) and distributed-memory machines (1024 cores) demonstrate that our runtime system is able to achieve good scalability. Furthermore, we provide analytical analysis to show why the tiled algorithms are scalable and the expected execution time.

Original languageEnglish
Title of host publicationProceedings of the Conference on High Performance Computing Networking, Storage and Analysis, SC '09
DOIs
StatePublished - 2009
Externally publishedYes
EventConference on High Performance Computing Networking, Storage and Analysis, SC '09 - Portland, OR, United States
Duration: Nov 14 2009Nov 20 2009

Publication series

NameProceedings of the Conference on High Performance Computing Networking, Storage and Analysis, SC '09

Conference

ConferenceConference on High Performance Computing Networking, Storage and Analysis, SC '09
Country/TerritoryUnited States
CityPortland, OR
Period11/14/0911/20/09

Fingerprint

Dive into the research topics of 'Dynamic task scheduling for linear algebra algorithms on distributed-memory multicore systems'. Together they form a unique fingerprint.

Cite this