. 2022 Jun 22:11:e74149.

doi: 10.7554/eLife.74149.

Different brain systems support learning from received and avoided pain during human pain-avoidance learning

Marieke Jepma^{1

2

3}, Mathieu Roy^{4

5}, Kiran Ramlakhan^{2

6}, Monique van Velzen⁷, Albert Dahan⁷

Affiliations

¹ Department of Psychology, University of Amsterdam, Amsterdam, Netherlands.
² Department of Psychology, Leiden University, Leiden, Netherlands.
³ Leiden Institute for Brain and Cognition, Leiden, Netherlands.
⁴ Department of Psychology, McGill University, Montreal, Canada.
⁵ Alan Edwards Centre for Research on Pain, McGill University, Montreal, Canada.
⁶ Department of Research and Statistics, Municipality of Amsterdam, Amsterdam, Netherlands.
⁷ Department of Anesthesiology, Leiden University Medical Center, Leiden, Netherlands.

PMID: 35731646
PMCID: PMC9217130
DOI: 10.7554/eLife.74149

Different brain systems support learning from received and avoided pain during human pain-avoidance learning

Marieke Jepma et al. Elife. 2022.

. 2022 Jun 22:11:e74149.

doi: 10.7554/eLife.74149.

Authors

Marieke Jepma^{1

2

3}, Mathieu Roy^{4

5}, Kiran Ramlakhan^{2

6}, Monique van Velzen⁷, Albert Dahan⁷

Affiliations

¹ Department of Psychology, University of Amsterdam, Amsterdam, Netherlands.
² Department of Psychology, Leiden University, Leiden, Netherlands.
³ Leiden Institute for Brain and Cognition, Leiden, Netherlands.
⁴ Department of Psychology, McGill University, Montreal, Canada.
⁵ Alan Edwards Centre for Research on Pain, McGill University, Montreal, Canada.
⁶ Department of Research and Statistics, Municipality of Amsterdam, Amsterdam, Netherlands.
⁷ Department of Anesthesiology, Leiden University Medical Center, Leiden, Netherlands.

PMID: 35731646
PMCID: PMC9217130
DOI: 10.7554/eLife.74149

Abstract

Both unexpected pain and unexpected pain absence can drive avoidance learning, but whether they do so via shared or separate neural and neurochemical systems is largely unknown. To address this issue, we combined an instrumental pain-avoidance learning task with computational modeling, functional magnetic resonance imaging (fMRI), and pharmacological manipulations of the dopaminergic (100 mg levodopa) and opioidergic (50 mg naltrexone) systems (N = 83). Computational modeling provided evidence that untreated participants learned more from received than avoided pain. Our dopamine and opioid manipulations negated this learning asymmetry by selectively increasing learning rates for avoided pain. Furthermore, our fMRI analyses revealed that pain prediction errors were encoded in subcortical and limbic brain regions, whereas no-pain prediction errors were encoded in frontal and parietal cortical regions. However, we found no effects of our pharmacological manipulations on the neural encoding of prediction errors. Together, our results suggest that human pain-avoidance learning is supported by separate threat- and safety-learning systems, and that dopamine and endogenous opioids specifically regulate learning from successfully avoided pain.

Keywords: computational modeling; dopamine; endogenous opioids; fMRI; human; neuroscience; pain-avoidance learning.

PubMed Disclaimer

Conflict of interest statement

MJ, MR, KR, Mv, AD No competing interests declared

Figures

**Figure 1.. Pain-avoidance learning task.**
(A) Outline of one trial. (B) Example of pain probabilities and choice data for one participant. The green and blue lines show the trial-specific probabilities of receiving a painful heat stimulus when choosing each option. The green and blue circles below the graph indicate the participant’s choices, and the red stars indicate trials on which pain was delivered. (C) Probability of switching per treatment group, as a function of pain 1–6 trials back. Error bars are standard errors.

**Figure 1—figure supplement 2.. Pain ratings during a pain-rating task that preceded the pain-avoidance learning task, as a function of stimulus temperature and treatment group.**
Error bars indicate standard errors. Participants received five 47°C, five 49°C, and five 50°C heat stimuli, in random order, to their left lower leg (ramp rate = 40°C/s; 1 s at peak temperature, stimulus onset asynchrony = 17–25 s). Following each heat stimulus participants rated their experienced pain on a 100-unit visual analog scale with anchors of ‘no pain’ and ‘worst-imaginable pain’, respectively. We conducted a mixed analysis of variance (ANOVA) on participants’ pain ratings with stimulus temperature (47, 49, and 50°C coded as −1, 0, and 1, respectively) as within-subject factor and treatment as between-subject factor. Pain ratings increased as a function of stimulus temperature (F(1,84) = 424, p < 0.001). However, there was no main effect of treatment (F(2,84) = 0.21, p = 0.81) and no temperature × treatment interaction (F(2,84) = 0.76, p = 0.47). Thus, the drugs did not affect the subjective pain experience evoked by heat-pain stimuli. Note that four participants who were included in this analysis were excluded from the functional magnetic resonance imaging (fMRI) analyses of the pain-avoidance learning task, because of excessive head movement.

**Figure 2.. Model parameters.**
(A) Posterior distributions of the parameters’ group-level means for each group (left and right panels). Parameters $α_{n o - p a i n}$ and $α_{p a i n}$ are learning rates for avoided and received pain outcomes, respectively; parameter $β$ is the inverse-temperature parameter. The middle panels are joint density plots of ${\bar{α}}_{p a i n}$ and ${\bar{α}}_{n o - p a i n}$ (dots are samples from the Markov chain Monte Carlo [MCMC]), showing that ${\bar{α}}_{p a i n}$ is reliably greater than ${\bar{α}}_{n o - p a i n}$ in the placebo group only. (B) The difference between the posterior distributions for each drug group vs. the placebo group, showing that ${\bar{α}}_{n o - p a i n}$ is greater and $\bar{β}$ is smaller in both drug groups compared to the placebo group. Red lines indicate 95% highest density intervals (HDIs).

**Figure 2—figure supplement 1.. The 95% highest density intervals (HDIs) of the posterior distributions of each participant’s learning rate for pain ( αpain) and no-pain (αno−pain) outcomes.**
Participants are sorted according to the difference between their two learning rates ( $α_{p a i n}$ − $α_{n o - p a i n}$ ). Note that $α_{p a i n}$ and $α_{n o - p a i n}$ were positively correlated in the levodopa group (r = 0.43, p = 0.03), but were not correlated in the placebo (r = −0.16, p = 0.4) and naltrexone (r = 0.31, p = 0.10) group.

Figure 2—figure supplement 2.. Modeling results from an independent sample of untreated participants from a previous study (N = 23), replicating the asymmetric learning rates (α-pain > α-no-pain) found in our placebo group.

**Figure 3.. Outcome-specific prediction-error signals (N = 74).**
(A) Activation tracking surprise more for received than avoided pain (yellow) and vice versa (blue). Note that this includes activation that tracks expected pain probability across both outcomes. Expected P(pain) = expected pain probability. (B) Activation tracking surprise for both received and avoided pain (i.e., absolute prediction error). Activation maps in A and B are thresholded at q < 0.05, false discovery rate (FDR) corrected for multiple comparisons across the whole brain. (C) Regions encoding surprise more for received than avoided pain, which cannot be explained by a general sensitivity to expected pain probability. These regions showed positive activation for both the first (A) and second (B) contrast, each thresholded at q < 0.05, FDR corrected. (D) Regions encoding surprise more for avoided than received pain, which cannot be explained by a general sensitivity to expected pain probability. These regions showed negative activation for the first, and positive activation for the second contrast, each thresholded at q < 0.05, FDR corrected. The line plots show the mean activity extracted from the brainstem and right amygdala (C) and left dlPFC and parietal (D) clusters per quartile of expected pain probability, illustrating the encoding of outcome-specific prediction errors in these regions. Error bars are standard errors.

Appendix 1—figure 1.. Recovered posterior medians of α-pain , α-no-pain, and β- for fits to datasets that were simulated using the posterior medians from the placebo (black) and drug (purple) groups.
The recovered values of ${\bar{α}}_{p a i n}$ do not differ between the two groups, but recovered ${\bar{α}}_{n o - p a i n}$ is reliably higher and recovered $\bar{β}$ reliably lower in the drug group, mirroring the parameter estimates obtained from fits to the empirical data.

**Appendix 2—figure 1.. Axiomatic tests of brain activation encoding general aversive and appetitive prediction errors (N =74).**
(A) Activation associated with the three axioms for aversive prediction errors in our task. Yellow regions showed the effects illustrated in the left panels, and blue regions showed the reverse effects (i.e., the axioms for appetitive prediction errors). All maps were thresholded at q < 0.05, false discovery rate (FDR) corrected for multiple comparisons across the whole brain, with higher voxel thresholds superimposed for display. (B) Conjunction results. Regions activated for each of the above three contrasts, all thresholded at q < 0.05 FDR corrected. Yellow and blue regions showed positive and negative responses for each contrast, respectively, thus encoded general aversive and appetitive prediction errors.

See this image and copyright information in PMC

References

1. Ahn WY, Vasilev G, Lee SH, Busemeyer JR, Kruschke JK, Bechara A, Vassileva J. Decision-making in stimulant and opiate addicts in protracted abstinence: evidence from computational modeling with pure users. Frontiers in Psychology. 2014;5:849. doi: 10.3389/fpsyg.2014.00849. - DOI - PMC - PubMed
1. Ahn WY, Haines N, Zhang L. Revealing Neurocomputational Mechanisms of Reinforcement Learning and Decision-Making With the hBayesDM Package. Computational Psychiatry (Cambridge, Mass.) 2017;1:24–57. doi: 10.1162/CPSY_a_00002. - DOI - PMC - PubMed
1. Bartra O, McGuire JT, Kable JW. The valuation system: a coordinate-based meta-analysis of BOLD fMRI experiments examining neural correlates of subjective value. NeuroImage. 2013;76:412–427. doi: 10.1016/j.neuroimage.2013.02.063. - DOI - PMC - PubMed
1. Beeler JA, Daw N, Frazier CRM, Zhuang X. Tonic dopamine modulates exploitation of reward learning. Frontiers in Behavioral Neuroscience. 2010;4:170. doi: 10.3389/fnbeh.2010.00170. - DOI - PMC - PubMed
1. Beierholm U, Guitart-Masip M, Economides M, Chowdhury R, Düzel E, Dolan R, Dayan P. Dopamine modulates reward-related vigor. Neuropsychopharmacology. 2013;38:1495–1503. doi: 10.1038/npp.2013.48. - DOI - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Different brain systems support learning from received and avoided pain during human pain-avoidance learning

Affiliations

Different brain systems support learning from received and avoided pain during human pain-avoidance learning

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Publication types

MeSH terms

Substances

LinkOut - more resources

Full Text Sources