TY - GEN
T1 - Balanced task clustering in scientific workflows
AU - Chen, Weiwei
AU - Da Silva, Rafael Ferreira
AU - Deelman, Ewa
AU - Sakellariou, Rizos
PY - 2013
Y1 - 2013
N2 - Scientific workflows can be composed of many fine computational granularity tasks. The runtime of these tasks may be shorter than the duration of system overheads, for example, when using multiple resources of a cloud infrastructure. Task clustering is a runtime optimization technique that merges multiple short tasks into a single job such that the scheduling overhead is reduced and the overall runtime performance is improved. However, existing task clustering strategies only provide a coarse-grained approach that relies on an over-simplified workflow model. In our work, we examine the reasons that cause Runtime Imbalance and Dependency Imbalance in task clustering. Next, we propose quantitative metrics to evaluate the severity of the two imbalance problems respectively. Furthermore, we propose a series of task balancing methods to address these imbalance problems. Finally, we analyze their relationship with the performance of these task balancing methods. A trace-based simulation shows our methods can significantly improve the runtime performance of two widely used workflows compared to the actual implementation of task clustering.
AB - Scientific workflows can be composed of many fine computational granularity tasks. The runtime of these tasks may be shorter than the duration of system overheads, for example, when using multiple resources of a cloud infrastructure. Task clustering is a runtime optimization technique that merges multiple short tasks into a single job such that the scheduling overhead is reduced and the overall runtime performance is improved. However, existing task clustering strategies only provide a coarse-grained approach that relies on an over-simplified workflow model. In our work, we examine the reasons that cause Runtime Imbalance and Dependency Imbalance in task clustering. Next, we propose quantitative metrics to evaluate the severity of the two imbalance problems respectively. Furthermore, we propose a series of task balancing methods to address these imbalance problems. Finally, we analyze their relationship with the performance of these task balancing methods. A trace-based simulation shows our methods can significantly improve the runtime performance of two widely used workflows compared to the actual implementation of task clustering.
KW - Data locality
KW - Load balance
KW - Scientific workflow
KW - Task clustering
UR - http://www.scopus.com/inward/record.url?scp=84893465558&partnerID=8YFLogxK
U2 - 10.1109/eScience.2013.40
DO - 10.1109/eScience.2013.40
M3 - Conference contribution
AN - SCOPUS:84893465558
SN - 9780768550831
T3 - Proceedings - IEEE 9th International Conference on e-Science, e-Science 2013
SP - 188
EP - 195
BT - Proceedings - IEEE 9th International Conference on e-Science, e-Science 2013
PB - IEEE Computer Society
T2 - 9th IEEE International Conference on e-Science, e-Science 2013
Y2 - 22 October 2013 through 25 October 2013
ER -