Interplay of approximate planning strategies
- PMID: 25675480
- PMCID: PMC4364207
- DOI: 10.1073/pnas.1414219112
Interplay of approximate planning strategies
Abstract
Humans routinely formulate plans in domains so complex that even the most powerful computers are taxed. To do so, they seem to avail themselves of many strategies and heuristics that efficiently simplify, approximate, and hierarchically decompose hard tasks into simpler subtasks. Theoretical and cognitive research has revealed several such strategies; however, little is known about their establishment, interaction, and efficiency. Here, we use model-based behavioral analysis to provide a detailed examination of the performance of human subjects in a moderately deep planning task. We find that subjects exploit the structure of the domain to establish subgoals in a way that achieves a nearly maximal reduction in the cost of computing values of choices, but then combine partial searches with greedy local steps to solve subtasks, and maladaptively prune the decision trees of subtasks in a reflexive manner upon encountering salient losses. Subjects come idiosyncratically to favor particular sequences of actions to achieve subgoals, creating novel complex actions or "options."
Keywords: hierarchical reinforcement learning; memoization; planning; pruning.
Conflict of interest statement
The authors declare no conflict of interest.
Figures
Comment in
-
How to divide and conquer the world, one step at a time.Proc Natl Acad Sci U S A. 2015 Mar 10;112(10):2929-30. doi: 10.1073/pnas.1500975112. Epub 2015 Mar 2. Proc Natl Acad Sci U S A. 2015. PMID: 25733879 Free PMC article. No abstract available.
References
-
- Sutton RS, Precup D, Singh S. Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artif Intell. 1999;112:181–211.
-
- Daw ND, Niv Y, Dayan P. Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control. Nat Neurosci. 2005;8(12):1704–1711. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
