Abstract
AI-driven scientific discovery has emerged as a transformative fifth paradigm in research, with agentic AI playing an increasingly prominent role across scientific domains. Agentic AI can enable collaborative AI-human or even fully autonomous decision-making, but it also introduces significant reliability challenges due to the dynamic and evolutionary nature of the AI agents. Specifically, foundation model-powered agents are prone to generating hallucinated, misleading, or adversarial outputs that can propagate silently through workflows and corrupt downstream results. In this paper we present a conceptual framework for a unified approach that integrates agentic workflow-level instrumentation and agent-level safeguards to enhance the reliability of the wider system, particularly critical in science. Embedding these mechanisms into a provenance-augmented infrastructure enables early detection, containment, and recovery from erroneous behavior, ultimately enhancing reliability and reproducibility in AI-assisted scientific workflows.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - 2025 IEEE International Conference on e-Science, eScience 2025 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 415-426 |
| Number of pages | 12 |
| ISBN (Electronic) | 9798331591458 |
| DOIs | |
| State | Published - 2025 |
| Event | 21st IEEE International Conference on e-Science, eScience 2025 - Chicago, United States Duration: Sep 15 2025 → Sep 18 2025 |
Publication series
| Name | Proceedings - 2025 IEEE International Conference on e-Science, eScience 2025 |
|---|
Conference
| Conference | 21st IEEE International Conference on e-Science, eScience 2025 |
|---|---|
| Country/Territory | United States |
| City | Chicago |
| Period | 09/15/25 → 09/18/25 |
Funding
This material is based upon work supported by the U.S. Department of Energy (DOE), Office of Science, Office of Advanced Scientific Computing Research, under Contract No. DE-AC02-06CH11357. This research used resources of the Oak Ridge Leadership Computing Facility at the Oak Ridge National Laboratory, which is supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC05-00OR22725.
Keywords
- Agentic AI
- Agentic Systems
- Agentic workflows
- Reliability
- Safety