Automated generation of scientific workflow generators with WfChef

Tainã Coleman, Henri Casanova, Rafael Ferreira da Silva

Research output: Contribution to journalArticlepeer-review

Abstract

Scientific workflow applications have gained significant importance, and their automated and efficient execution on large-scale computing platforms has been the subject of extensive research and development. For these efforts to be successful, a solid experimental methodology is needed to evaluate workflow algorithms and systems. A foundation for this methodology is the availability of realistic workflow instances. Although public repositories provide workflow instances for a few scientific applications, these are limited in scope, and workflow instances are not available for all application scales of interest. To address this limitation, previous work has developed generators of synthetic workflow instances of arbitrary scales. Despite being popular, the implementation of these generators is a manual and labor-intensive process that requires expert application knowledge. As a result, these generators only target a handful of applications, even though there are hundreds of workflow applications in production. We introduce WfChef, a fully automated framework for constructing a synthetic workflow generator for any scientific application. Based on an input set of workflow instances for a particular application, WfChef automatically produces a synthetic workflow generator. To measure the realism of the generated workflows, we define and evaluate several metrics. Using these metrics, we compare the realism of the workflows generated by WfChef generators to that of the workflows generated by the previously available, hand-crafted generators. We find that WfChef generators not only require zero development effort (because they are automatically produced), but also generate workflows that are more realistic than those generated by hand-crafted generators.

Original languageEnglish
Pages (from-to)16-29
Number of pages14
JournalFuture Generation Computer Systems
Volume147
DOIs
StatePublished - Oct 2023

Bibliographical note

Publisher Copyright:
© 2023 Elsevier B.V.

Keywords

  • Automatic workflow generation
  • Realistic synthetic workflow instance
  • Scientific workflows
  • Synthetic workflow generation
  • Workflow management systems
  • Workflow pattern detection

Fingerprint

Dive into the research topics of 'Automated generation of scientific workflow generators with WfChef'. Together they form a unique fingerprint.

Cite this