Abstract
Scientific workflows have become ubiquitous across scientific fields, and their execution methods and systems continue to be the subject of research and development. Most experimental evaluations of these workflows rely on workflow instances, which can be either real-world or synthetic, to ensure relevance to current application domains or explore hypothetical/future scenarios. The WfCommons project addresses this need by providing data and tools to support such evaluations. In this paper, we present an overview of WfCommons and describe two recent developments. Firstly, we introduce a workflow execution "tracer"for Nextflow, which significantly enhances the set of real-world instances available in WfCommons. Secondly, we describe a workflow instance "translator"that enables the execution of any real-world or synthetic WfCommons workflow instance using Dask. Our contributions aim to provide researchers and practitioners with more comprehensive resources for evaluating scientific workflows.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - 2023 IEEE 47th Annual Computers, Software, and Applications Conference, COMPSAC 2023 |
| Editors | Hossain Shahriar, Yuuichi Teranishi, Alfredo Cuzzocrea, Moushumi Sharmin, Dave Towey, AKM Jahangir Alam Majumder, Hiroki Kashiwazaki, Ji-Jiang Yang, Michiharu Takemoto, Nazmus Sakib, Ryohei Banno, Sheikh Iqbal Ahamed |
| Publisher | IEEE Computer Society |
| Pages | 1870-1875 |
| Number of pages | 6 |
| ISBN (Electronic) | 9798350326970 |
| DOIs | |
| State | Published - 2023 |
| Event | 47th IEEE Annual Computers, Software, and Applications Conference, COMPSAC 2023 - Hybrid, Torino, Italy Duration: Jun 26 2023 → Jun 30 2023 |
Publication series
| Name | Proceedings - International Computer Software and Applications Conference |
|---|---|
| Volume | 2023-June |
| ISSN (Print) | 0730-3157 |
Conference
| Conference | 47th IEEE Annual Computers, Software, and Applications Conference, COMPSAC 2023 |
|---|---|
| Country/Territory | Italy |
| City | Hybrid, Torino |
| Period | 06/26/23 → 06/30/23 |
Funding
This manuscript has been authored in part by UT-Battelle, LLC, under contract DE-AC05-00OR22725 with the US Department of Energy (DOE). The publisher, by accepting the article for publication, acknowledges that the U.S. Government retains a non-exclusive, paid up, irrevocable, world-wide license to publish or reproduce the published form of the manuscript, or allow others to do so, for U.S. Government purposes. The DOE will provide public access to these results in accordance with the DOE Public Access Plan (http://energy.gov/downloads/doe-public-access-plan). ACKNOWLEDGMENT This research used resources of the Oak Ridge Leadership Computing Facility at the Oak Ridge National Laboratory, which is supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC05-00OR22725. This research was partially supported by NSF Awards #2106059 and #2103489. Finally, we thank the NSF Chameleon Cloud for providing time grants to access their resources.
Keywords
- Scientific workflows
- workflow instance collection
- workflow instance execution