Review

. 2021 Nov 17;109(22):3552-3575.

doi: 10.1016/j.neuron.2021.09.034. Epub 2021 Oct 21.

The learning of prospective and retrospective cognitive maps within neural circuits

Vijay Mohan K Namboodiri¹, Garret D Stuber²

Affiliations

¹ Department of Neurology, Center for Integrative Neuroscience, Kavli Institute for Fundamental Neuroscience, Neuroscience Graduate Program, University of California, San Francisco, San Francisco, CA 94158, USA. Electronic address: vijaymohan.knamboodiri@ucsf.edu.
² Center for the Neurobiology of Addiction, Pain, and Emotion, Department of Anesthesiology and Pain Medicine, Department of Pharmacology, Neuroscience Graduate Program, University of Washington, Seattle, WA 98195, USA. Electronic address: gstuber@uw.edu.

PMID: 34678148
PMCID: PMC8809184
DOI: 10.1016/j.neuron.2021.09.034

Review

The learning of prospective and retrospective cognitive maps within neural circuits

Vijay Mohan K Namboodiri et al. Neuron. 2021.

. 2021 Nov 17;109(22):3552-3575.

doi: 10.1016/j.neuron.2021.09.034. Epub 2021 Oct 21.

Authors

Vijay Mohan K Namboodiri¹, Garret D Stuber²

Affiliations

¹ Department of Neurology, Center for Integrative Neuroscience, Kavli Institute for Fundamental Neuroscience, Neuroscience Graduate Program, University of California, San Francisco, San Francisco, CA 94158, USA. Electronic address: vijaymohan.knamboodiri@ucsf.edu.
² Center for the Neurobiology of Addiction, Pain, and Emotion, Department of Anesthesiology and Pain Medicine, Department of Pharmacology, Neuroscience Graduate Program, University of Washington, Seattle, WA 98195, USA. Electronic address: gstuber@uw.edu.

PMID: 34678148
PMCID: PMC8809184
DOI: 10.1016/j.neuron.2021.09.034

Abstract

Brain circuits are thought to form a "cognitive map" to process and store statistical relationships in the environment. A cognitive map is commonly defined as a mental representation that describes environmental states (i.e., variables or events) and the relationship between these states. This process is commonly conceptualized as a prospective process, as it is based on the relationships between states in chronological order (e.g., does reward follow a given state?). In this perspective, we expand this concept on the basis of recent findings to postulate that in addition to a prospective map, the brain forms and uses a retrospective cognitive map (e.g., does a given state precede reward?). In doing so, we demonstrate that many neural signals and behaviors (e.g., habits) that seem inflexible and non-cognitive can result from retrospective cognitive maps. Together, we present a significant conceptual reframing of the neurobiological study of associative learning, memory, and decision making.

PubMed Disclaimer

Figures

**Figure 1:**
The causal relationship between reward predictors and rewards may be learned prospectively or retrospectively.

**Figure 2:**
Schematic experiments illustrating prospective and retrospective transition probabilities. In the top experiment, there is a high prospective and retrospective probability between the reward predictor and reward. ITI stands for intertrial interval, i.e. the duration between a reward and the next reward predictor. In the middle experiment, the prospective probability is low since cue/action predicts reward only 50% of the time. However, retrospective probability is high since every reward is preceded by the cue/action. In the bottom experiment, prospective probability is high, as every cue/action is followed by a reward, but the retrospective probability is low since not every reward is preceded by the cue/action.

**Figure 3**
Successor and Predecessor representations: A. A state space that illustrates the key difference between successor and predecessor representations. Here, state 1 transitions with 10% probability to state 2, which then transitions with 10% probability to a reward state. Thus, obtaining reward is only possible by starting at state 1, even though the probability of reward is extremely low when starting at state 1 (1%). The challenge of an animal is to learn that the only feasible path to a reward state is by starting in state 1. B. The values of the successor representation to a reward state for states 1 and 2 are shown under the assumption of a discount factor of 0.9 (calculated in Appendix 1). These are very low and reflect the fact that reward states typically occur far into the future when starting in these states (due to low transition probabilities to reward state). Hence, these low values do not highlight that a reward state is only feasible if the animal starts in state 1. C. The retrospective state space for this example, showing that ending up in a reward state means that it is certain that the previous state was state 2 and that the second previous state was state 1. Thus, a retrospective evaluation makes it clear that a reward state is only feasible if one starts in state 1. D. The predecessor representation of the two states to the reward state. These values are very high compared to the SR and highlight the fact that a reward state is only feasible if the animal starts in state 1. PR is higher for state 1 because it is a much more frequent state (see text).

**Figure 4**
Neuronal activity in select OFC neuronal subpopulations is consistent with a representation of the retrospective transition probability A. Longitudinal tracking of the same neurons across many days using two-photon calcium imaging (reproduced from Namboodiri et al. 2019). Four example neurons are shown in different colored arrows. B. Qualitative summary of data from three separate subpopulations of neurons identified by clustering neuronal activity (summarized from Namboodiri et al. 2019). Comparison with Figure 2 shows qualitative correspondence of these groups with a representation of prospective and retrospective transition probabilities. C. Additional test of the representation of a retrospective transition probability using extinction of learned cue-reward pairing. The expected subjective probabilities are shown. D. Anticipatory licking induced by the cue, showing that animals learn extinction. E. Mean normalized fluorescence of longitudinally tracked OFC→VTA neurons (n=27 cluster 1, n=23 cluster 5) plotted against time locked to cue onset. Cue response (between the dashed lines) is high even after extinction, consistent with the expected subjective retrospective transition probability.

**Figure 5**
Representation of retrospective transition probability in a songbird brain: The y-axis measures the response modulation of neurons in area HVC of the Bengalese finch to a syllable (Bouchard and Brainard, 2013). The x-axis measures the retrospective transition probability from that syllable to the preceding sequence in the natural song of the bird. An increase in retrospective transition probability to the preceding stimulus causes a linear increase in response of HVC neurons. Reproduced here with permission (Fig 4G in original publication).

**Figure 6**
Reconceptualization of the function of several neural circuits. Here, we speculatively propose a reconceptualized framework of the function of several nodes of the neural circuits involved in associative learning. While we propose some evidence consistent with our framework in the text, we present this framework primarily to stimulate future experimental testing. For simplicity, we omit representations of reward value/magnitude. Further, we are not proposing that the listed functions completely describe a given node. Almost certainly, each node is involved in many other functions due to the heterogeneity of cell types.

**Box Fig 1.**
Illustration of SR and PR contingencies: A. An example high dimensional state space. All prospective transition probabilities are denoted by the corresponding arrows. B. Intuitive interpretation of the state space in A. Since the only path to reward goes through state 1, state 1 is the most important state to organize learning around. C. SR and PR for all states to the reward state. Here, the discounting factor was set to 0.99. Neither the SR nor the PR magnitudes highlight the fact that state 1 is the most important state for the path to reward. Note that the PR values here mostly reflect how frequent each state is, with state 5 being the most frequent state. This is also the reason why the mean SR value is much lower than the mean PR value, as the mean SR value reflects the relative frequency of the reward state. D. SR and PR contingencies for all states to the reward state. These quantities account for the relative frequencies of all states. SR contingency measures how much more frequently a given state occurs after reward compared to a random state. PR contingency measures how much more frequently a given state occurs before reward compared to a random state. PR contingency quantitatively measures all the important intuitive observations in B regarding the state space.

See this image and copyright information in PMC

Cited by

Sensory cortical ensembles exhibit differential coupling to ripples in distinct hippocampal subregions.
Jeong H, Namboodiri VMK, Jung MW, Andermann ML. Jeong H, et al. bioRxiv [Preprint]. 2023 Mar 18:2023.03.17.533028. doi: 10.1101/2023.03.17.533028. bioRxiv. 2023. Update in: Curr Biol. 2023 Dec 4;33(23):5185-5198.e4. doi: 10.1016/j.cub.2023.10.073. PMID: 36993665 Free PMC article. Updated. Preprint.
Mesolimbic dopamine release conveys causal associations.
Jeong H, Taylor A, Floeder JR, Lohmann M, Mihalas S, Wu B, Zhou M, Burke DA, Namboodiri VMK. Jeong H, et al. Science. 2022 Dec 23;378(6626):eabq6740. doi: 10.1126/science.abq6740. Epub 2022 Dec 23. Science. 2022. PMID: 36480599 Free PMC article.
Noise Resilience of Successor and Predecessor Feature Algorithms in One- and Two-Dimensional Environments.
Lee H. Lee H. Sensors (Basel). 2025 Feb 6;25(3):979. doi: 10.3390/s25030979. Sensors (Basel). 2025. PMID: 39943618 Free PMC article.
Humans adaptively deploy forward and backward prediction.
Sharp PB, Eldar E. Sharp PB, et al. Nat Hum Behav. 2024 Sep;8(9):1726-1737. doi: 10.1038/s41562-024-01930-8. Epub 2024 Jul 16. Nat Hum Behav. 2024. PMID: 39014069 Free PMC article.
Learning temporal relationships between symbols with Laplace Neural Manifolds.
Howard MW, Esfahani ZG, Le B, Sederberg PB. Howard MW, et al. ArXiv [Preprint]. 2024 Sep 22:arXiv:2302.10163v4. ArXiv. 2024. PMID: 36866224 Free PMC article. Preprint.

See all "Cited by" articles

References

1. Abramson CI (2009). A Study in Inspiration: Charles Henry Turner (1867–1923) and the Investigation of Insect Behavior. Annual Review of Entomology 54, 343–359. - PubMed
1. Adams CD (1982). Variations in the sensitivity of instrumental responding to reinforcer devaluation. The Quarterly Journal of Experimental Psychology Section B 34, 77–98.
1. Afsardeir A, and Keramati M (2018). Behavioural signatures of backward planning in animals. Eur J Neurosci 47, 479–487. - PubMed
1. Alarcón DE, Bonardi C, and Delamater AR (2018). Associative mechanisms involved in specific Pavlovian-to-instrumental transfer in human learning tasks. Quarterly Journal of Experimental Psychology 71, 1607–1625. - PMC - PubMed
1. Ambrose RE, Pfeiffer BE, and Foster DJ (2016). Reverse Replay of Hippocampal Place Cells Is Uniquely Modulated by Changing Reward. Neuron 91, 1124–1136. - PMC - PubMed

Publication types

Actions
Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

The learning of prospective and retrospective cognitive maps within neural circuits

Affiliations

The learning of prospective and retrospective cognitive maps within neural circuits

Authors

Affiliations

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources