TY - GEN
T1 - Actor-Critic Algorithm for Optimal Synchronization of Kuramoto Oscillator
AU - Vrushabh, D.
AU - Shalini, K.
AU - Sonam, K.
N1 - Publisher Copyright:
© 2020 IEEE.
PY - 2020/6/29
Y1 - 2020/6/29
N2 - This paper constructs a reinforcement learning (RL) based algorithm of Actor-Critic (AC) for the optimal synchronism of the Kuramoto oscillator. This is accomplished through the Ott-Antonsen ansatz framework for the dynamics of large interactive unit networks. Besides, this approach reduces the infinite-dimensional dynamics to phase space flow, i.e., low dimensional dynamics for certain systems of globally coupled phase oscillators. The resulting Hamiltonian-Jacobi-Bellman (HJB) expression is extremely difficult to solve in general, therefore this paper introduces the AC method for learning approximate optimal control laws for the Kuramoto oscillator model. RL has been contemplated as one of the efficient methods to solve optimal control of non-linear systems. For a collection of non-homogeneous oscillators, the states are elucidated as phase angles, which is the modification of the model for a coupled Kuramoto oscillator. An admissible initial control policy for the Kuramoto oscillator model is designed and solved using RL giving an approximate solution of the optimal control problem. Finally, local synchronism of the coupled Kuramoto oscillator model is supported through simulations analysis.
AB - This paper constructs a reinforcement learning (RL) based algorithm of Actor-Critic (AC) for the optimal synchronism of the Kuramoto oscillator. This is accomplished through the Ott-Antonsen ansatz framework for the dynamics of large interactive unit networks. Besides, this approach reduces the infinite-dimensional dynamics to phase space flow, i.e., low dimensional dynamics for certain systems of globally coupled phase oscillators. The resulting Hamiltonian-Jacobi-Bellman (HJB) expression is extremely difficult to solve in general, therefore this paper introduces the AC method for learning approximate optimal control laws for the Kuramoto oscillator model. RL has been contemplated as one of the efficient methods to solve optimal control of non-linear systems. For a collection of non-homogeneous oscillators, the states are elucidated as phase angles, which is the modification of the model for a coupled Kuramoto oscillator. An admissible initial control policy for the Kuramoto oscillator model is designed and solved using RL giving an approximate solution of the optimal control problem. Finally, local synchronism of the coupled Kuramoto oscillator model is supported through simulations analysis.
KW - Approximate Dynamic Programming
KW - Hamilton-Jacobi-Bellman
KW - Kuramoto oscillator
KW - Mean-field game
KW - Order parameter
KW - Reinforcement learning
UR - http://www.scopus.com/inward/record.url?scp=85098288507&partnerID=8YFLogxK
U2 - 10.1109/CoDIT49905.2020.9263785
DO - 10.1109/CoDIT49905.2020.9263785
M3 - Conference contribution
AN - SCOPUS:85098288507
T3 - 7th International Conference on Control, Decision and Information Technologies, CoDIT 2020
SP - 391
EP - 396
BT - 7th International Conference on Control, Decision and Information Technologies, CoDIT 2020
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 7th International Conference on Control, Decision and Information Technologies, CoDIT 2020
Y2 - 29 June 2020 through 2 July 2020
ER -