Neural Network-Based Solutions for Stochastic Optimal Control Using Path Integrals
- PMID: 28212072
- DOI: 10.1109/TNNLS.2016.2544787
Neural Network-Based Solutions for Stochastic Optimal Control Using Path Integrals
Abstract
In this paper, an offline approximate dynamic programming approach using neural networks is proposed for solving a class of finite horizon stochastic optimal control problems. There are two approaches available in the literature, one based on stochastic maximum principle (SMP) formalism and the other based on solving the stochastic Hamilton-Jacobi-Bellman (HJB) equation. However, in the presence of noise, the SMP formalism becomes complex and results in having to solve a couple of backward stochastic differential equations. Hence, current solution methodologies typically ignore the noise effect. On the other hand, the inclusion of noise in the HJB framework is very straightforward. Furthermore, the stochastic HJB equation of a control-affine nonlinear stochastic system with a quadratic control cost function and an arbitrary state cost function can be formulated as a path integral (PI) problem. However, due to curse of dimensionality, it might not be possible to utilize the PI formulation for obtaining comprehensive solutions over the entire operating domain. A neural network structure called the adaptive critic design paradigm is used to effectively handle this difficulty. In this paper, a novel adaptive critic approach using the PI formulation is proposed for solving stochastic optimal control problems. The potential of the algorithm is demonstrated through simulation results from a couple of benchmark problems.
Similar articles
-
Design of nonlinear optimal control for chaotic synchronization of coupled stochastic neural networks via Hamilton-Jacobi-Bellman equation.Neural Netw. 2018 Mar;99:166-177. doi: 10.1016/j.neunet.2018.01.003. Epub 2018 Feb 7. Neural Netw. 2018. PMID: 29427843
-
Policy-Iteration-Based Finite-Horizon Approximate Dynamic Programming for Continuous-Time Nonlinear Optimal Control.IEEE Trans Neural Netw Learn Syst. 2023 Sep;34(9):5255-5267. doi: 10.1109/TNNLS.2022.3225090. Epub 2023 Sep 1. IEEE Trans Neural Netw Learn Syst. 2023. PMID: 37015565
-
Finite-Time Adaptive Dynamic Programming for Affine-Form Nonlinear Systems.IEEE Trans Neural Netw Learn Syst. 2025 Feb;36(2):3573-3586. doi: 10.1109/TNNLS.2023.3337387. Epub 2025 Feb 6. IEEE Trans Neural Netw Learn Syst. 2025. PMID: 38060361
-
A policy iteration approach to online optimal control of continuous-time constrained-input systems.ISA Trans. 2013 Sep;52(5):611-21. doi: 10.1016/j.isatra.2013.04.004. Epub 2013 May 24. ISA Trans. 2013. PMID: 23706414
-
Advances in Zeroing Neural Networks: Bio-Inspired Structures, Performance Enhancements, and Applications.Biomimetics (Basel). 2025 Apr 29;10(5):279. doi: 10.3390/biomimetics10050279. Biomimetics (Basel). 2025. PMID: 40422109 Free PMC article. Review.
Publication types
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials
Miscellaneous