Abstract
Large Language Models (LLMs) and other foundation models are increasingly used as the core of AI agents. In agentic workflows, these agents plan tasks, interact with humans and peers, and influence scientific outcomes across federated and heterogeneous environments. However, agents can hallucinate or reason incorrectly, propagating errors when one agent's output becomes another's input. Thus, assuring that agents' actions are transparent, traceable, reproducible, and reliable is critical to assess hallucination risks and mitigate their workflow impacts. While provenance techniques have long supported these principles, existing methods fail to capture and relate agent-centric metadata such as prompts, responses, and decisions with the broader workflow context and downstream outcomes. In this paper, we introduce PROV-AGENT, a provenance model that extends W3C PROV and leverages the Model Context Protocol (MCP) and data observability to integrate agent interactions into end-to-end workflow provenance. Our contributions include: (1) a provenance model tailored for agentic workflows, (2) a near real-time, open-source system for capturing agentic provenance, and (3) a cross-facility evaluation spanning edge, cloud, and HPC environments, demonstrating support for critical provenance queries and agent reliability analysis.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - 2025 IEEE International Conference on e-Science, eScience 2025 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 467-473 |
| Number of pages | 7 |
| ISBN (Electronic) | 9798331591458 |
| DOIs | |
| State | Published - 2025 |
| Event | 21st IEEE International Conference on e-Science, eScience 2025 - Chicago, United States Duration: Sep 15 2025 → Sep 18 2025 |
Publication series
| Name | Proceedings - 2025 IEEE International Conference on e-Science, eScience 2025 |
|---|
Conference
| Conference | 21st IEEE International Conference on e-Science, eScience 2025 |
|---|---|
| Country/Territory | United States |
| City | Chicago |
| Period | 09/15/25 → 09/18/25 |
Funding
The authors thank the ORNL team: Miaosen Chai, Timothy Poteet, Phillipe Austria, Marshall McDonnell, Ross Miller, A.J. Ruckman, Tyler Skluzacek, Feiyi Wang, Sarp Oral, Arjun Shankar for their help with the use case development. ChatGPT-4o was used to help polish writing, improve conciseness, and check grammar. This research used resources of the Oak Ridge Leadership Computing Facility at Oak Ridge National Laboratory, supported by the Office of Science, U.S. Department of Energy (DOE) under Contract No. DE-AC05-00OR22725. Additional DOE support was provided under Contract No. DE-AC02-06CH11357.
Keywords
- Agentic Workflows
- LLM
- Lineage
- Provenance
- Responsible AI
- Workflows