Bi-objective scheduling algorithms for optimizing makespan and reliability on heterogeneous systems

Jack J. Dongarra, Emmanuel Jeannot, Erik Saule, Zhiao Shi

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

132 Scopus citations

Abstract

We tackle the problem of scheduling task graphs onto a heterogeneous set of machines, where each processor has a probability of failure governed by an exponential law. The goal is to design algorithms that optimize both makespan and reliability. First, we provide an optimal scheduling algorithm for independent unitary tasks where the objective is to maximize the reliability subject to makespan minimization. For the bi-criteria case, we provide an algorithm that approximates the Pareto-curve. Next, for independent non-unitary tasks, we show that the product {failure rate} × {unitary instruction execution time} is crucial to distinguish processors in this context. Based on these results we are able to let the user choose a trade-off between reliability maximization and makespan minimization. For general task graphs we provide a method for converting scheduling heuristics on heterogeneous cluster into heuristics that take reliability into account. Here again, we show how we can help the user to select a trade-off between makespan and reliability.

Original languageEnglish
Title of host publicationSPAA'07
Subtitle of host publicationProceedings of the Nineteenth Annual Symposium on Parallelism in Algorithms and Architectures
Pages280-288
Number of pages9
DOIs
StatePublished - 2007
Externally publishedYes
EventSPAA'07: 19th Annual Symposium on Parallelism in Algorithms and Architectures - San Diego, CA, United States
Duration: Jun 9 2007Jun 11 2007

Publication series

NameAnnual ACM Symposium on Parallelism in Algorithms and Architectures

Conference

ConferenceSPAA'07: 19th Annual Symposium on Parallelism in Algorithms and Architectures
Country/TerritoryUnited States
CitySan Diego, CA
Period06/9/0706/11/07

Keywords

  • DAG
  • Pareto-curve
  • Reliability
  • Scheduling

Fingerprint

Dive into the research topics of 'Bi-objective scheduling algorithms for optimizing makespan and reliability on heterogeneous systems'. Together they form a unique fingerprint.

Cite this