A semantic-based approach to attain reproducibility of computational environments in scientific workflows: A case study

Idafen Santana-Perez, Rafael Ferreira da Silva, Mats Rynge, Ewa Deelman, María S. Pérez-Hernández, Oscar Corcho

Research output: Contribution to journalArticlepeer-review

6 Scopus citations

Abstract

Reproducible research in scientific workflows is often addressed by tracking the provenance of the produced results. While this approach allows inspecting intermediate and final results, improves understanding, and permits replaying a workflow execution, it does not ensure that the computational environment is available for subsequent executions to reproduce the experiment. In this work, we propose describing the resources involved in the execution of an experiment using a set of semantic vocabularies, so as to conserve the computational environment. We define a process for documenting the workflow application, management system, and their dependencies based on 4 domain ontologies. We then conduct an experimental evaluation using a real workflow application on an academic and a public Cloud platform. Results show that our approach can reproduce an equivalent execution environment of a predefined virtual machine image on both computing platforms.

Original languageEnglish
Pages (from-to)452-463
Number of pages12
JournalLecture Notes in Computer Science
Volume8805
DOIs
StatePublished - 2014
Externally publishedYes

Funding

FundersFunder number
Indiana University
National Science Foundation0910812

    Fingerprint

    Dive into the research topics of 'A semantic-based approach to attain reproducibility of computational environments in scientific workflows: A case study'. Together they form a unique fingerprint.

    Cite this