Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Aug;241(8):1631-1644.
doi: 10.1007/s00213-024-06586-w. Epub 2024 Apr 10.

5-HT 2A and 5-HT 2C receptor antagonism differentially modulate reinforcement learning and cognitive flexibility: behavioural and computational evidence

Affiliations

5-HT 2A and 5-HT 2C receptor antagonism differentially modulate reinforcement learning and cognitive flexibility: behavioural and computational evidence

Mona El-Sayed Hervig et al. Psychopharmacology (Berl). 2024 Aug.

Abstract

Rationale: Cognitive flexibility, the ability to adapt behaviour in response to a changing environment, is disrupted in several neuropsychiatric disorders, including obsessive-compulsive disorder and major depressive disorder. Evidence suggests that flexibility, which can be operationalised using reversal learning tasks, is modulated by serotonergic transmission. However, how exactly flexible behaviour and associated reinforcement learning (RL) processes are modulated by 5-HT action on specific receptors is unknown.

Objectives: We investigated the effects of 5-HT2A receptor (5-HT2AR) and 5-HT2C receptor (5-HT2CR) antagonism on flexibility and underlying RL mechanisms.

Methods: Thirty-six male Lister hooded rats were trained on a touchscreen visual discrimination and reversal task. We evaluated the effects of systemic treatments with the 5-HT2AR and 5-HT2CR antagonists M100907 and SB-242084, respectively, on reversal learning and performance on probe trials where correct and incorrect stimuli were presented with a third, probabilistically rewarded, stimulus. Computational models were fitted to task choice data to extract RL parameters, including a novel model designed specifically for this task.

Results: 5-HT2AR antagonism impaired reversal learning only after an initial perseverative phase, during a period of random choice and then new learning. 5-HT2CR antagonism, on the other hand, impaired learning from positive feedback. RL models further differentiated these effects. 5-HT2AR antagonism decreased punishment learning rate (i.e. negative feedback) at high and low doses. The low dose also decreased reinforcement sensitivity (beta) and increased stimulus and side stickiness (i.e., the tendency to repeat a choice regardless of outcome). 5-HT2CR antagonism also decreased beta, but reduced side stickiness.

Conclusions: These data indicate that 5-HT2A and 5-HT2CRs both modulate different aspects of flexibility, with 5-HT2ARs modulating learning from negative feedback as measured using RL parameters and 5-HT2CRs for learning from positive feedback assessed through conventional measures.

Keywords: 5-HT receptors; Cognitive flexibility; Punishment learning; Reinforcement learning; Reward learning; Stickiness.

PubMed Disclaimer

Conflict of interest statement

JWD has received research grants from Boehringer Ingelheim Pharma GmbH and GlaxoSmithKline and receives royalties from Springer Verlag. TWR discloses consultancy with Cambridge Cognition; he receives editorial honoraria from Springer-Nature and Elsevier and a research grant from Shionogi. RNC consults for Campden Instruments and receives royalties from Cambridge Enterprise, Routledge, and Cambridge University Press. KZ, MEH, JA, SFO, TB and BP have no conflicts to declare.

Figures

Fig. 1
Fig. 1
Experimental design. (A) Table of groups and treatments. (N, number of subjects). (B) VPVD stages and stimuli in the M100907 and SB-242,084 experiments. A is the 100% reinforced stimulus, B is the 0% reinforced stimulus, C is reinforced on 50% of probe trials. (C) Experimental timeline, including pretraining and experimental weeks. (disc., discrimination; VPVD, valence-probe visual discrimination (1 and 2); VS, visual stimulus pair (1, 2 and 3); VS2, visual stimulus pair 2; VS3, visual stimulus pair 3
Fig. 2
Fig. 2
Effects of M100907 on VPVD parameters. (A) Percent correct and percent optimal choice across sessions. (B) Errors to criterion and errors per phase. Results are represented as mean ± standard error of the mean (SEM); *** p < 0.01, # p < 0.1
Fig. 3
Fig. 3
Results from the hierarchical Bayesian winning RL model 9, showing differences in group mean parameters following M100907 administration. (A) Reward and punishment learning rate parameters. (B) Reinforcement sensitivity, side and stimulus stickiness parameters. (C) Discount factor. (LOW, low dose; HIGH, high dose; CON, vehicle; Reinf., reinforcement; HDI, highest posterior density interval. Red indicates 0 ∉ 95% HDI; orange indicates 0 ∉75% HDI)
Fig. 4
Fig. 4
Effects of SB-242048 on VPVD parameters. (A) Percent correct and percent optimal choice across sessions. (B) Errors to criterion and errors per phase. Results are represented as mean ± SEM; *** p < 0.01, # p < 0.1
Fig. 5
Fig. 5
Results from the hierarchical Bayesian winning RL model 7, showing differences in group mean parameters following SB-242048 administration. (A) Reward and punishment learning rate parameters. (B) Reinforcement sensitivity and side stickiness parameters. (LOW, low dose; HIGH, high dose; CON, vehicle; Reinf., reinforcement; HDI, highest posterior interval. Red indicates 0 ∉ 95% HDI; orange indicates 0 ∉ 75% HDI)

References

    1. Aghajanian GK, Marek GJ (1999) Serotonin and hallucinogens. Neuropsychopharmacology 21. 10.1016/S0893-133X(98)00135-3 - PubMed
    1. Alsiö J, Nilsson SRO, Gastambide F, Wang RAH, Dam SA, Mar AC, Tricklebank M, Robbins TW (2015) The role of 5-HT2C receptors in touchscreen visual reversal learning in the rat: a cross-site study. 10.1007/s00213-015-3963-5. Psychopharmacology - PMC - PubMed
    1. Alsiö J, Phillips BU, Sala-Bayo J, Nilsson SRO, Calafat-Pla TC, Rizwand A, Plumbridge JM, López-Cruz L, Dalley JW, Cardinal RN, Mar AC, Robbins TW (2019) Dopamine D2-like receptor stimulation blocks negative feedback in visual and spatial reversal learning in the rat: behavioural and computational evidence. Psychopharmacology 236(8):2307–2323. 10.1007/s00213-019-05296-y 10.1007/s00213-019-05296-y - DOI - PMC - PubMed
    1. Alsiö J, Lehmann O, Mckenzie C, Theobald DE, Searle L, Xia J, Dalley JW, Robbins TW (2021) Serotonergic innervations of the Orbitofrontal and Medial-prefrontal cortices are differentially involved in visual discrimination and reversal learning in rats, vol 31. Cerebral Cortex, p 1090. (New York, NY)210.1093/CERCOR/BHAA277 - PMC - PubMed
    1. Alvarez BD, Morales CA, Amodeo DA (2021) Impact of specific serotonin receptor modulation on behavioral flexibility. In Pharmacology Biochemistry and Behavior (Vol. 209). 10.1016/j.pbb.2021.173243 - PMC - PubMed

MeSH terms

LinkOut - more resources