A stochastic gradient descent approach for stochastic optimal control

Richard Archibald, Feng Bao, Jiongmin Yong

Research output: Contribution to journalArticlepeer-review

10 Scopus citations

Abstract

In this work, we introduce a stochastic gradient descent approach to solve the stochastic optimal control problem through stochastic maximum principle. The motivation that drives our method is the gradient of the cost functional in the stochastic optimal control problem is under expectation, and numerical calculation of such an expectation requires fully computation of a system of forward backward stochastic differential equations, which is computationally expensive. By evaluating the expectation with single-sample representation as suggested by the stochastic gradient descent type optimisation, we could save computational efforts in solving FBSDEs and only focus on the optimisation task which aims to determine the optimal control process.

Original languageEnglish
Pages (from-to)635-658
Number of pages24
JournalEast Asian Journal on Applied Mathematics
Volume10
Issue number4
DOIs
StatePublished - Nov 2020

Funding

This work is partially supported by the Scientific Discovery through Advanced Computing (SciDAC) program funded by U.S. Department of Energy, Office of Science, Advanced Scientific Computing Research through FASTMath Institute and CompFUSE project. The second author also acknowledges support by U.S. National Science Foundation under Contract DMS-1720222. The third author acknowledges the partial support by NSF grant DMS-1812921.

FundersFunder number
FASTMath Institute
U.S. National Science Foundation
National Science FoundationDMS-1720222, DMS-1812921
U.S. Department of Energy
Office of Science

    Keywords

    • Forward backward stochastic differential equations
    • Maximum principle
    • Stochastic gradient descent
    • Stochastic optimal control

    Fingerprint

    Dive into the research topics of 'A stochastic gradient descent approach for stochastic optimal control'. Together they form a unique fingerprint.

    Cite this