TY - GEN
T1 - Bi-objective scheduling algorithms for optimizing makespan and reliability on heterogeneous systems
AU - Dongarra, Jack J.
AU - Jeannot, Emmanuel
AU - Saule, Erik
AU - Shi, Zhiao
PY - 2007
Y1 - 2007
N2 - We tackle the problem of scheduling task graphs onto a heterogeneous set of machines, where each processor has a probability of failure governed by an exponential law. The goal is to design algorithms that optimize both makespan and reliability. First, we provide an optimal scheduling algorithm for independent unitary tasks where the objective is to maximize the reliability subject to makespan minimization. For the bi-criteria case, we provide an algorithm that approximates the Pareto-curve. Next, for independent non-unitary tasks, we show that the product {failure rate} × {unitary instruction execution time} is crucial to distinguish processors in this context. Based on these results we are able to let the user choose a trade-off between reliability maximization and makespan minimization. For general task graphs we provide a method for converting scheduling heuristics on heterogeneous cluster into heuristics that take reliability into account. Here again, we show how we can help the user to select a trade-off between makespan and reliability.
AB - We tackle the problem of scheduling task graphs onto a heterogeneous set of machines, where each processor has a probability of failure governed by an exponential law. The goal is to design algorithms that optimize both makespan and reliability. First, we provide an optimal scheduling algorithm for independent unitary tasks where the objective is to maximize the reliability subject to makespan minimization. For the bi-criteria case, we provide an algorithm that approximates the Pareto-curve. Next, for independent non-unitary tasks, we show that the product {failure rate} × {unitary instruction execution time} is crucial to distinguish processors in this context. Based on these results we are able to let the user choose a trade-off between reliability maximization and makespan minimization. For general task graphs we provide a method for converting scheduling heuristics on heterogeneous cluster into heuristics that take reliability into account. Here again, we show how we can help the user to select a trade-off between makespan and reliability.
KW - DAG
KW - Pareto-curve
KW - Reliability
KW - Scheduling
UR - http://www.scopus.com/inward/record.url?scp=35248884762&partnerID=8YFLogxK
U2 - 10.1145/1248377.1248423
DO - 10.1145/1248377.1248423
M3 - Conference contribution
AN - SCOPUS:35248884762
SN - 159593667X
SN - 9781595936677
T3 - Annual ACM Symposium on Parallelism in Algorithms and Architectures
SP - 280
EP - 288
BT - SPAA'07
T2 - SPAA'07: 19th Annual Symposium on Parallelism in Algorithms and Architectures
Y2 - 9 June 2007 through 11 June 2007
ER -