Abstract
Modern scientific workflows require hybrid infrastructures combining numerous decentralized resources on the IoT/Edge interconnected to Cloud/HPC systems (aka the Computing Continuum) to enable their optimized execution. Understanding and optimizing the performance of such complex Edge-to-Cloud workflows is challenging. Capturing the provenance of key performance indicators, with their related data and processes, may assist in understanding and optimizing workflow executions. However, the capture overhead can be prohibitive, particularly in resource-constrained devices, such as the ones on the IoT/Edge.To address this challenge, based on a performance analysis of existing systems, we propose ProvLight, a tool to enable efficient provenance capture on the IoT/Edge. We leverage simplified data models, data compression and grouping, and lightweight transmission protocols to reduce overheads. We further integrate ProvLight into the E2Clab framework to enable workflow provenance capture across the Edge-to-Cloud Continuum. This integration makes E2Clab a promising platform for the performance optimization of applications through reproducible experiments.We validate ProvLight at a large scale with synthetic workloads on 64 real-life IoT/Edge devices in the FIT IoT LAB testbed. Evaluations show that ProvLight outperforms state-of-the-art systems like ProvLake and DfAnalyzer in resource-constrained devices. ProvLight is 26 - 37x faster to capture and transmit provenance data; uses 5 - 7x less CPU; 2x less memory; transmits 2x less data; and consumes 2 - 2.5x less energy. ProvLight [1] and E2Clab [2] are available as open-source tools.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - 2023 IEEE International Conference on Cluster Computing, CLUSTER 2023 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 221-233 |
| Number of pages | 13 |
| ISBN (Electronic) | 9798350307924 |
| DOIs | |
| State | Published - 2023 |
| Externally published | Yes |
| Event | 25th IEEE International Conference on Cluster Computing, CLUSTER 2023 - Santa Fe, United States Duration: Oct 31 2023 → Nov 3 2023 |
Publication series
| Name | Proceedings - IEEE International Conference on Cluster Computing, ICCC |
|---|---|
| ISSN (Print) | 1552-5244 |
Conference
| Conference | 25th IEEE International Conference on Cluster Computing, CLUSTER 2023 |
|---|---|
| Country/Territory | United States |
| City | Santa Fe |
| Period | 10/31/23 → 11/3/23 |
Funding
ACKNOWLEDGMENTS This work was funded by Inria through the HPC-BigData Inria Challenge (IPL), by the French ANR OverFlow project (ANR-15-CE25-0003), and the HPDaSc associate team with Brazil. Marta Mattoso and Débora Pina are funded by CNPq and FAPERJ. Renan is at the Oak Ridge Leadership Computing Facility at the Oak Ridge National Laboratory, which is supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC05-00OR22725. Experiments presented in this paper were carried out using the Grid’5000 and FIT IoT LAB testbeds, supported by a scientific interest group hosted by several Universities and organizations.
Keywords
- Computing Continuum
- Edge
- IoT
- Lineage
- Provenance
- Workflows