The rational use of causal inference to guide reinforcement learning strengthens with age

Alexandra O Cohen¹, Kate Nussenbaum¹, Hayley M Dorfman², Samuel J Gershman^{2

3}, Catherine A Hartley^{1

4}

Affiliations

¹ Department of Psychology, New York University, New York, NY 10003 USA.
² Department of Psychology, Harvard University, Cambridge, MA 02138 USA.
³ Center for Brain Science, Harvard University, Cambridge, MA 02138 USA.
⁴ Center for Neural Science, New York University, New York, NY 1003 USA.

PMID: 33133638
PMCID: PMC7591882
DOI: 10.1038/s41539-020-00075-3

The rational use of causal inference to guide reinforcement learning strengthens with age

Alexandra O Cohen et al. NPJ Sci Learn. 2020.

. 2020 Oct 27:5:16.

doi: 10.1038/s41539-020-00075-3. eCollection 2020.

Authors

Alexandra O Cohen¹, Kate Nussenbaum¹, Hayley M Dorfman², Samuel J Gershman^{2

3}, Catherine A Hartley^{1

4}

Affiliations

¹ Department of Psychology, New York University, New York, NY 10003 USA.
² Department of Psychology, Harvard University, Cambridge, MA 02138 USA.
³ Center for Brain Science, Harvard University, Cambridge, MA 02138 USA.
⁴ Center for Neural Science, New York University, New York, NY 1003 USA.

PMID: 33133638
PMCID: PMC7591882
DOI: 10.1038/s41539-020-00075-3

Abstract

Beliefs about the controllability of positive or negative events in the environment can shape learning throughout the lifespan. Previous research has shown that adults' learning is modulated by beliefs about the causal structure of the environment such that they update their value estimates to a lesser extent when the outcomes can be attributed to hidden causes. This study examined whether external causes similarly influenced outcome attributions and learning across development. Ninety participants, ages 7 to 25 years, completed a reinforcement learning task in which they chose between two options with fixed reward probabilities. Choices were made in three distinct environments in which different hidden agents occasionally intervened to generate positive, negative, or random outcomes. Participants' beliefs about hidden-agent intervention aligned with the true probabilities of the positive, negative, or random outcome manipulation in each of the three environments. Computational modeling of the learning data revealed that while the choices made by both adults (ages 18-25) and adolescents (ages 13-17) were best fit by Bayesian reinforcement learning models that incorporate beliefs about hidden-agent intervention, those of children (ages 7-12) were best fit by a one learning rate model that updates value estimates based on choice outcomes alone. Together, these results suggest that while children demonstrate explicit awareness of the causal structure of the task environment, they do not implicitly use beliefs about the causal structure of the environment to guide reinforcement learning in the same manner as adolescents and adults.

Keywords: Decision making; Human behaviour.

PubMed Disclaimer

Conflict of interest statement

Competing interestsThe authors declare no competing interests.

Figures

**Fig. 1. Experimental design.**
Participants were told that they were mining for gold in three territories visited by three different agents who intervened to generate negative (Robber), positive (Millionaire), or random (Sheriff) outcomes. On each trial, participants selected a mine (Choice), observed the outcome associated with their choice (Feedback), and indicated their belief about hidden agent intervention (Attribution). Elements of this image were designed by Freepik (https://www.freepik.com/) and are licensed for personal use.

**Fig. 2. Attribution data.**
Participant’s beliefs about hidden agent intervention aligned with the experimental manipulation, and attribution rates were higher overall in younger individuals. We also observed an age by reward outcome interaction, indicating that younger individuals attributed positive outcomes to the hidden agent more so than older individuals.

**Fig. 3. Observed and simulated learning data.**
Across territories, older participants learned to select the best mine faster. Younger participants showed better learning in the Millionaire territory (a). Simulated data using subjects’ fitted parameter estimates for each of the best fitting models are depicted (**b–d**). Boxes denote the age group that was best fit by the model. Participants are separated by age group (Children: 7–12, Adolescents: 13–17, Adult: 18–25) and trial bins for visualization purposes. The corresponding statistical analyses on the empirical data treat age and trial as continuous variables.

**Fig. 4. Computational model comparison.**
Children were best fit by a one learning rate model, adolescents were best fit by the adaptive Bayesian model, and adults were best fit by the empirical Bayesian model, as indexed by protected exceedance probabilities (PXPs).

See this image and copyright information in PMC

References

1. Christakou A, et al. Neural and psychological maturation of decision-making in adolescence and young adulthood. J. Cogn. Neurosci. 2013;25:1807–1823. doi: 10.1162/jocn_a_00447. - DOI - PubMed
1. Decker JH, Otto AR, Daw ND, Hartley CA. From creatures of habit to goal-directed learners: tracking the developmental emergence of model-based reinforcement learning. Psychol. Sci. 2016;27:848–858. doi: 10.1177/0956797616639301. - DOI - PMC - PubMed
1. Jones RM, et al. Adolescent-specific patterns of behavior and neural activity during social reinforcement learning. Cogn. Affect. Behav. Neurosci. 2014;14:683–697. doi: 10.3758/s13415-014-0257-z. - DOI - PMC - PubMed
1. Palminteri, S., Kilford, E. J., Coricelli, G. & Blakemore, S. J. The computational development of reinforcement learning during adolescence. PLoS Comput. Biol. 12, e1004953 (2016). - PMC - PubMed
1. van den Bos W, Cohen MX, Kahnt T, Crone EA. Striatum-medial prefrontal cortex connectivity predicts developmental changes in reinforcement learning. Cereb. Cortex. 2012;22:1247–1255. doi: 10.1093/cercor/bhr198. - DOI - PMC - PubMed

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

The rational use of causal inference to guide reinforcement learning strengthens with age

Affiliations

The rational use of causal inference to guide reinforcement learning strengthens with age

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

LinkOut - more resources

Full Text Sources