Balanced task clustering in scientific workflows

Weiwei Chen, Rafael Ferreira Da Silva, Ewa Deelman, Rizos Sakellariou

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

39 Scopus citations

Abstract

Scientific workflows can be composed of many fine computational granularity tasks. The runtime of these tasks may be shorter than the duration of system overheads, for example, when using multiple resources of a cloud infrastructure. Task clustering is a runtime optimization technique that merges multiple short tasks into a single job such that the scheduling overhead is reduced and the overall runtime performance is improved. However, existing task clustering strategies only provide a coarse-grained approach that relies on an over-simplified workflow model. In our work, we examine the reasons that cause Runtime Imbalance and Dependency Imbalance in task clustering. Next, we propose quantitative metrics to evaluate the severity of the two imbalance problems respectively. Furthermore, we propose a series of task balancing methods to address these imbalance problems. Finally, we analyze their relationship with the performance of these task balancing methods. A trace-based simulation shows our methods can significantly improve the runtime performance of two widely used workflows compared to the actual implementation of task clustering.

Original languageEnglish
Title of host publicationProceedings - IEEE 9th International Conference on e-Science, e-Science 2013
PublisherIEEE Computer Society
Pages188-195
Number of pages8
ISBN (Print)9780768550831
DOIs
StatePublished - 2013
Externally publishedYes
Event9th IEEE International Conference on e-Science, e-Science 2013 - Beijing, China
Duration: Oct 22 2013Oct 25 2013

Publication series

NameProceedings - IEEE 9th International Conference on e-Science, e-Science 2013

Conference

Conference9th IEEE International Conference on e-Science, e-Science 2013
Country/TerritoryChina
CityBeijing
Period10/22/1310/25/13

Keywords

  • Data locality
  • Load balance
  • Scientific workflow
  • Task clustering

Fingerprint

Dive into the research topics of 'Balanced task clustering in scientific workflows'. Together they form a unique fingerprint.

Cite this