TY - GEN
T1 - Minimizing stretch and makespan of multiple Parallel Task Graphs via malleable allocations
AU - Casanova, Henri
AU - Desprez, Frédéric
AU - Suter, Frédéric
PY - 2010
Y1 - 2010
N2 - Many scientific applications can be structured as Parallel Task Graphs (PTGs), i.e., graphs of data-parallel tasks. Adding data-parallelism to a task-parallel application provides opportunities for higher performance and scalability, but poses scheduling challenges. We study the off-line scheduling of multiple PTGs on a single, homogeneous cluster. The objective is to optimize per-formance and fairness. We propose a novel algorithm that first computes perfectly fair PTG completion times assuming that each PTG is an ideal malleable job. These completion times are then relaxed so that the schedule is organized as a sequence of periods and is still close to the perfectly fair schedule. Finally, since PTGs are not perfectly malleable, the algorithm increases the execution time of all PTGs uniformly until it can successfully schedule each task in a period. Our evaluation in simulation, using both synthetic and real-world application configurations, shows that our algorithm outperforms previously proposed algorithms when considering two different performance metrics and one fairness metric.
AB - Many scientific applications can be structured as Parallel Task Graphs (PTGs), i.e., graphs of data-parallel tasks. Adding data-parallelism to a task-parallel application provides opportunities for higher performance and scalability, but poses scheduling challenges. We study the off-line scheduling of multiple PTGs on a single, homogeneous cluster. The objective is to optimize per-formance and fairness. We propose a novel algorithm that first computes perfectly fair PTG completion times assuming that each PTG is an ideal malleable job. These completion times are then relaxed so that the schedule is organized as a sequence of periods and is still close to the perfectly fair schedule. Finally, since PTGs are not perfectly malleable, the algorithm increases the execution time of all PTGs uniformly until it can successfully schedule each task in a period. Our evaluation in simulation, using both synthetic and real-world application configurations, shows that our algorithm outperforms previously proposed algorithms when considering two different performance metrics and one fairness metric.
UR - http://www.scopus.com/inward/record.url?scp=78649622919&partnerID=8YFLogxK
U2 - 10.1109/ICPP.2010.16
DO - 10.1109/ICPP.2010.16
M3 - Conference contribution
AN - SCOPUS:78649622919
SN - 9780769541563
T3 - Proceedings of the International Conference on Parallel Processing
SP - 71
EP - 80
BT - Proceedings - 39th International Conference on Parallel Processing, ICPP 2010
T2 - 39th International Conference on Parallel Processing, ICPP 2010
Y2 - 13 September 2010 through 16 September 2010
ER -