Operation Optimization Using Reinforcement Learning with Integrated Artificial Reasoning Framework

Junyung Kim, Daniel Mikkelson, Xinyan Wang, Xingang Zhao, Hyun Gook Kang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In large and complex systems, operational decision-making requires a systematic analysis with a vast amount of data from both process parameters and component status monitoring. In this pape we present an integrated artificial reasoning approach for system state transition models that can help operational decision-making with explainable and traceable reasoning. The integrated artificial reasoning framework is a physics-based approach of defining the system structure in a Bayesian network, so we leveraged it in a Markov decision process (MDP) for finding optimal operational solutions. In our proposed framework, the MDP is implemented on a dynamic Bayesian network (DBN), which represents causalities in a system. The multilevel flow modelin was utilized in order to extract these causalities in a more efficient and objective manner. Since multilevel flow modeling is based on the fundamental energy and mass conservation laws, the target system is decomposed into several mass, energy, and information structures, which serve a the basis for a DBN. The MDP consists of the processes of finding a solution for the Bellman equation, which can be derived from the conditional probability equations of the constructed DBN. System operators can capture stochastic system dynamics as multiple subsystem state transitions based on their physical relations and uncertainties coming from the component degradation process or random failures. We analyzed a simplified example system to illustrate finding an optimal operational policy with this approach.

Original languageEnglish
Title of host publicationProceedings of 13th Nuclear Plant Instrumentation, Control and Human-Machine Interface Technologies, NPIC and HMIT 2023
PublisherAmerican Nuclear Society
Pages1678-1687
Number of pages10
ISBN (Electronic)9780894487910
DOIs
StatePublished - 2023
Event13th Nuclear Plant Instrumentation, Control and Human-Machine Interface Technologies, NPIC and HMIT 2023 - Knoxville, United States
Duration: Jul 15 2023Jul 20 2023

Publication series

NameProceedings of 13th Nuclear Plant Instrumentation, Control and Human-Machine Interface Technologies, NPIC and HMIT 2023

Conference

Conference13th Nuclear Plant Instrumentation, Control and Human-Machine Interface Technologies, NPIC and HMIT 2023
Country/TerritoryUnited States
CityKnoxville
Period07/15/2307/20/23

Keywords

  • Markov decision process
  • Modelica
  • Reinforcement learning
  • high-temperature gas reactor

Fingerprint

Dive into the research topics of 'Operation Optimization Using Reinforcement Learning with Integrated Artificial Reasoning Framework'. Together they form a unique fingerprint.

Cite this