Reinforcement-learning-based output-feedback control of nonstrict nonlinear discrete-time systems with application to engine emission control

Peter Shih¹, Brian C Kaul, Sarangapani Jagannathan, James A Drallmeier

Affiliations

PMID: 19336317
DOI: 10.1109/TSMCB.2009.2013272

Reinforcement-learning-based output-feedback control of nonstrict nonlinear discrete-time systems with application to engine emission control

Peter Shih et al. IEEE Trans Syst Man Cybern B Cybern. 2009 Oct.

. 2009 Oct;39(5):1162-79.

doi: 10.1109/TSMCB.2009.2013272. Epub 2009 Mar 24.

Authors

Peter Shih¹, Brian C Kaul, Sarangapani Jagannathan, James A Drallmeier

Affiliation

¹ Department of Electrical and Computer Engineering, Missouri University of Science and Technology, Rolla, MO 65409, USA.

PMID: 19336317
DOI: 10.1109/TSMCB.2009.2013272

Abstract

A novel reinforcement-learning-based output adaptive neural network (NN) controller, which is also referred to as the adaptive-critic NN controller, is developed to deliver the desired tracking performance for a class of nonlinear discrete-time systems expressed in nonstrict feedback form in the presence of bounded and unknown disturbances. The adaptive-critic NN controller consists of an observer, a critic, and two action NNs. The observer estimates the states and output, and the two action NNs provide virtual and actual control inputs to the nonlinear discrete-time system. The critic approximates a certain strategic utility function, and the action NNs minimize the strategic utility function and control inputs. All NN weights adapt online toward minimization of a performance index, utilizing the gradient-descent-based rule, in contrast with iteration-based adaptive-critic schemes. Lyapunov functions are used to show the stability of the closed-loop tracking error, weights, and observer estimates. Separation and certainty equivalence principles, persistency of excitation condition, and linearity in the unknown parameter assumption are not needed. Experimental results on a spark ignition (SI) engine operating lean at an equivalence ratio of 0.75 show a significant (25%) reduction in cyclic dispersion in heat release with control, while the average fuel input changes by less than 1% compared with the uncontrolled case. Consequently, oxides of nitrogen (NO(x)) drop by 30%, and unburned hydrocarbons drop by 16% with control. Overall, NO(x)'s are reduced by over 80% compared with stoichiometric levels.

PubMed Disclaimer

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions

LinkOut - more resources

Full Text Sources
- IEEE Engineering in Medicine and Biology Society
Other Literature Sources
- The Lens - Patent Citations Database

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Reinforcement-learning-based output-feedback control of nonstrict nonlinear discrete-time systems with application to engine emission control

Affiliation

Reinforcement-learning-based output-feedback control of nonstrict nonlinear discrete-time systems with application to engine emission control

Authors

Affiliation

Abstract

Publication types

MeSH terms

Substances

LinkOut - more resources

Full Text Sources

Other Literature Sources