KheOps: Cost-effective Repeatability, Reproducibility, and Replicability of Edge-to-Cloud Experiments

  • Daniel Rosendo
  • , Kate Keahey
  • , Alexandru Costan
  • , Matthieu Simonin
  • , Patrick Valduriez
  • , Gabriel Antoniu

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

7 Scopus citations

Abstract

Distributed infrastructures for computation and analytics are now evolving towards an interconnected ecosystem allowing complex scientific workflows to be executed across hybrid systems spanning from IoT Edge devices to Clouds, and sometimes to supercomputers (the Computing Continuum). Understanding the performance trade-offs of large-scale workflows deployed on such complex Edge-to-Cloud Continuum is challenging. To achieve this, one needs to systematically perform experiments, to enable their reproducibility and allow other researchers to replicate the study and the obtained conclusions on different infrastructures. This breaks down to the tedious process of reconciling the numerous experimental requirements and constraints with low-level infrastructure design choices. To address the limitations of the main state-of-the-art approaches for distributed, collaborative experimentation, such as Google Colab, Kaggle, and Code Ocean, we propose KheOps, a collaborative environment specifically designed to enable cost-effective reproducibility and replicability of Edge-to-Cloud experiments. KheOps is composed of three core elements: (1) an experiment repository; (2) a notebook environment; and (3) a multi-platform experiment methodology. We illustrate KheOps with a real-life Edge-to-Cloud application. The evaluations explore the point of view of the authors of an experiment described in an article (who aim to make their experiments reproducible) and the perspective of their readers (who aim to replicate the experiment). The results show how KheOps helps authors to systematically perform repeatable and reproducible experiments on the Grid5000 + FIT IoT LAB testbeds. Furthermore, KheOps helps readers to cost-effectively replicate authors experiments in different infrastructures such as Chameleon Cloud + CHI@Edge testbeds, and obtain the same conclusions with high accuracies (> 88% for all performance metrics).

Original languageEnglish
Title of host publicationProceedings of the 1st ACM Conference on Reproducibility and Replicability, REP 2023
PublisherAssociation for Computing Machinery, Inc
Pages62-73
Number of pages12
ISBN (Electronic)9798400701764
DOIs
StatePublished - Jun 27 2023
Externally publishedYes
Event1st ACM Conference on Reproducibility and Replicability, REP 2023 - Santa Cruz, United States
Duration: Jun 27 2023Jun 29 2023

Publication series

NameProceedings of the 1st ACM Conference on Reproducibility and Replicability, REP 2023

Conference

Conference1st ACM Conference on Reproducibility and Replicability, REP 2023
Country/TerritoryUnited States
CitySanta Cruz
Period06/27/2306/29/23

Funding

This work was funded by Inria through the HPC-BigData Inria Challenge (IPL) and through the UNIFY Associate Team joint in the framework of the JLESC international lab and the HPDeSc associate team with Brazil. It was co-funded by the French ANR OverFlow project (ANR-15-CE25-0003). Experiments presented in this paper were carried out using the Chameleon Cloud, CHI@Edge, Grid’5000, and FIT IoT LAB testbeds, supported by a scientific interest group hosted by several Universities. We also would like to thank Argonne National Laboratory for supporting this work. This material is based upon work supported by the U.S. Department of Energy, Office of Science, under contract number DE-AC02-06CH11357 as well as by the NSF award 2130889 and NIFA award 2021-67021-33775. This work was funded by Inria through the HPC-BigData Inria Challenge (IPL) and through the UNIFY Associate Team joint in the framework of the JLESC international lab and the HPDeSc associate team with Brazil. It was co-funded by the French ANR OverFlow project (ANR-15- CE25-0003). Experiments presented in this paper were carried out using the Chameleon Cloud, CHI@Edge, Grid'5000, and FIT IoT LAB testbeds, supported by a scientific interest group hosted by several Universities.We also would like to thank Argonne National Laboratory for supporting thiswork. This material is based upon work supported by the U.S. Department of Energy, Office of Science, under contract number DE-AC02-06CH11357 as well as by the NSF award 2130889 and NIFA award 2021-67021-33775.

Keywords

  • Cloud Computing
  • Computing Continuum
  • Edge Computing
  • Repeatability
  • Replicability
  • Reproducibility
  • Workflows

Fingerprint

Dive into the research topics of 'KheOps: Cost-effective Repeatability, Reproducibility, and Replicability of Edge-to-Cloud Experiments'. Together they form a unique fingerprint.

Cite this