Adaptive dynamic programming approach to experience-based systems identification and control

George G Lendaris¹

Affiliations

Affiliation

¹ Portland State University, Systems Science Graduate Program, Electrical & Computer Engineering and NW Computational Intelligence Laboratory, SySc, P.O. Box 751 Portland, OR 97207, United States. lendaris@sysc.pdx.edu

PMID: 19632087
DOI: 10.1016/j.neunet.2009.06.021

Adaptive dynamic programming approach to experience-based systems identification and control

George G Lendaris. Neural Netw. 2009 Jul-Aug.

. 2009 Jul-Aug;22(5-6):822-32.

doi: 10.1016/j.neunet.2009.06.021. Epub 2009 Jul 2.

Author

George G Lendaris¹

Affiliation

¹ Portland State University, Systems Science Graduate Program, Electrical & Computer Engineering and NW Computational Intelligence Laboratory, SySc, P.O. Box 751 Portland, OR 97207, United States. lendaris@sysc.pdx.edu

PMID: 19632087
DOI: 10.1016/j.neunet.2009.06.021

Abstract

Humans have the ability to make use of experience while selecting their control actions for distinct and changing situations, and their process speeds up and have enhanced effectiveness as more experience is gained. In contrast, current technological implementations slow down as more knowledge is stored. A novel way of employing Approximate (or Adaptive) Dynamic Programming (ADP) is described that shifts the underlying Adaptive Critic type of Reinforcement Learning method "up a level", away from designing individual (optimal) controllers to that of developing on-line algorithms that efficiently and effectively select designs from a repository of existing controller solutions (perhaps previously developed via application of ADP methods). The resulting approach is called Higher-Level Learning Algorithm. The approach and its rationale are described and some examples of its application are given. The notions of context and context discernment are important to understanding the human abilities noted above. These are first defined, in a manner appropriate to controls and system-identification, and as a foundation relating to the application arena, a historical view of the various phases during development of the controls field is given, organized by how the notion 'context' was, or was not, involved in each phase.

PubMed Disclaimer

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- Elsevier Science

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Adaptive dynamic programming approach to experience-based systems identification and control

Affiliation

Adaptive dynamic programming approach to experience-based systems identification and control

Author

Affiliation

Abstract

Similar articles

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources