Adaptive dynamic programming approach to experience-based systems identification and control
- PMID: 19632087
- DOI: 10.1016/j.neunet.2009.06.021
Adaptive dynamic programming approach to experience-based systems identification and control
Abstract
Humans have the ability to make use of experience while selecting their control actions for distinct and changing situations, and their process speeds up and have enhanced effectiveness as more experience is gained. In contrast, current technological implementations slow down as more knowledge is stored. A novel way of employing Approximate (or Adaptive) Dynamic Programming (ADP) is described that shifts the underlying Adaptive Critic type of Reinforcement Learning method "up a level", away from designing individual (optimal) controllers to that of developing on-line algorithms that efficiently and effectively select designs from a repository of existing controller solutions (perhaps previously developed via application of ADP methods). The resulting approach is called Higher-Level Learning Algorithm. The approach and its rationale are described and some examples of its application are given. The notions of context and context discernment are important to understanding the human abilities noted above. These are first defined, in a manner appropriate to controls and system-identification, and as a foundation relating to the application arena, a historical view of the various phases during development of the controls field is given, organized by how the notion 'context' was, or was not, involved in each phase.
Similar articles
-
Higher level application of ADP: a next phase for the control field?IEEE Trans Syst Man Cybern B Cybern. 2008 Aug;38(4):901-12. doi: 10.1109/TSMCB.2008.918073. IEEE Trans Syst Man Cybern B Cybern. 2008. PMID: 18632376 Review.
-
Improved Adaptive-Reinforcement Learning Control for morphing unmanned air vehicles.IEEE Trans Syst Man Cybern B Cybern. 2008 Aug;38(4):1014-20. doi: 10.1109/TSMCB.2008.922018. IEEE Trans Syst Man Cybern B Cybern. 2008. PMID: 18632393
-
Intelligence in the brain: a theory of how it works and how to build it.Neural Netw. 2009 Apr;22(3):200-12. doi: 10.1016/j.neunet.2009.03.012. Epub 2009 Mar 29. Neural Netw. 2009. PMID: 19386468 Review.
-
Adaptive critic learning techniques for engine torque and air-fuel ratio control.IEEE Trans Syst Man Cybern B Cybern. 2008 Aug;38(4):988-93. doi: 10.1109/TSMCB.2008.922019. IEEE Trans Syst Man Cybern B Cybern. 2008. PMID: 18632389
-
Reinforcement learning for partially observable dynamic processes: adaptive dynamic programming using measured output data.IEEE Trans Syst Man Cybern B Cybern. 2011 Feb;41(1):14-25. doi: 10.1109/TSMCB.2010.2043839. Epub 2010 Mar 29. IEEE Trans Syst Man Cybern B Cybern. 2011. PMID: 20350860
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources