Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jun 22:11:e74149.
doi: 10.7554/eLife.74149.

Different brain systems support learning from received and avoided pain during human pain-avoidance learning

Affiliations

Different brain systems support learning from received and avoided pain during human pain-avoidance learning

Marieke Jepma et al. Elife. .

Abstract

Both unexpected pain and unexpected pain absence can drive avoidance learning, but whether they do so via shared or separate neural and neurochemical systems is largely unknown. To address this issue, we combined an instrumental pain-avoidance learning task with computational modeling, functional magnetic resonance imaging (fMRI), and pharmacological manipulations of the dopaminergic (100 mg levodopa) and opioidergic (50 mg naltrexone) systems (N = 83). Computational modeling provided evidence that untreated participants learned more from received than avoided pain. Our dopamine and opioid manipulations negated this learning asymmetry by selectively increasing learning rates for avoided pain. Furthermore, our fMRI analyses revealed that pain prediction errors were encoded in subcortical and limbic brain regions, whereas no-pain prediction errors were encoded in frontal and parietal cortical regions. However, we found no effects of our pharmacological manipulations on the neural encoding of prediction errors. Together, our results suggest that human pain-avoidance learning is supported by separate threat- and safety-learning systems, and that dopamine and endogenous opioids specifically regulate learning from successfully avoided pain.

Keywords: computational modeling; dopamine; endogenous opioids; fMRI; human; neuroscience; pain-avoidance learning.

PubMed Disclaimer

Conflict of interest statement

MJ, MR, KR, Mv, AD No competing interests declared

Figures

Figure 1.
Figure 1.. Pain-avoidance learning task.
(A) Outline of one trial. (B) Example of pain probabilities and choice data for one participant. The green and blue lines show the trial-specific probabilities of receiving a painful heat stimulus when choosing each option. The green and blue circles below the graph indicate the participant’s choices, and the red stars indicate trials on which pain was delivered. (C) Probability of switching per treatment group, as a function of pain 1–6 trials back. Error bars are standard errors.
Figure 1—figure supplement 1.
Figure 1—figure supplement 1.. Pre- and post-treatment ratings of alertness, calmness, and contentment in each treatment group.
Error bars indicate standard errors. The participants assigned to the three treatment groups did not differ in their pretreatment ratings of alertness, calmness, or contentment (all p’s > 0.3). To assess potential treatment effects on subjective state we conducted analyses of covariance (ANCOVAs) on the post-treatment ratings of alertness, calmness, and contentment (made 2 hr after drug intake), with treatment as a between-subject factor and the pretreatment ratings as covariate. There was no effect of treatment on any of these ratings (all p’s > 0.08), suggesting that the drugs did not affect subjective state.
Figure 1—figure supplement 2.
Figure 1—figure supplement 2.. Pain ratings during a pain-rating task that preceded the pain-avoidance learning task, as a function of stimulus temperature and treatment group.
Error bars indicate standard errors. Participants received five 47°C, five 49°C, and five 50°C heat stimuli, in random order, to their left lower leg (ramp rate = 40°C/s; 1 s at peak temperature, stimulus onset asynchrony = 17–25 s). Following each heat stimulus participants rated their experienced pain on a 100-unit visual analog scale with anchors of ‘no pain’ and ‘worst-imaginable pain’, respectively. We conducted a mixed analysis of variance (ANOVA) on participants’ pain ratings with stimulus temperature (47, 49, and 50°C coded as −1, 0, and 1, respectively) as within-subject factor and treatment as between-subject factor. Pain ratings increased as a function of stimulus temperature (F(1,84) = 424, p < 0.001). However, there was no main effect of treatment (F(2,84) = 0.21, p = 0.81) and no temperature × treatment interaction (F(2,84) = 0.76, p = 0.47). Thus, the drugs did not affect the subjective pain experience evoked by heat-pain stimuli. Note that four participants who were included in this analysis were excluded from the functional magnetic resonance imaging (fMRI) analyses of the pain-avoidance learning task, because of excessive head movement.
Figure 2.
Figure 2.. Model parameters.
(A) Posterior distributions of the parameters’ group-level means for each group (left and right panels). Parameters αno-pain and αpain are learning rates for avoided and received pain outcomes, respectively; parameter β is the inverse-temperature parameter. The middle panels are joint density plots of α-pain and α-no-pain (dots are samples from the Markov chain Monte Carlo [MCMC]), showing that α-pain is reliably greater than α-no-pain in the placebo group only. (B) The difference between the posterior distributions for each drug group vs. the placebo group, showing that α-no-pain is greater and β- is smaller in both drug groups compared to the placebo group. Red lines indicate 95% highest density intervals (HDIs).
Figure 2—figure supplement 1.
Figure 2—figure supplement 1.. The 95% highest density intervals (HDIs) of the posterior distributions of each participant’s learning rate for pain ( αpain) and no-pain (αnopain) outcomes.
Participants are sorted according to the difference between their two learning rates (αpainαno-pain). Note that αpain and αno-pain were positively correlated in the levodopa group (r = 0.43, p = 0.03), but were not correlated in the placebo (r = −0.16, p = 0.4) and naltrexone (r = 0.31, p = 0.10) group.
Figure 2—figure supplement 2.
Figure 2—figure supplement 2.. Modeling results from an independent sample of untreated participants from a previous study (N = 23), replicating the asymmetric learning rates (α-pain > α-no-pain) found in our placebo group.
Figure 3.
Figure 3.. Outcome-specific prediction-error signals (N = 74).
(A) Activation tracking surprise more for received than avoided pain (yellow) and vice versa (blue). Note that this includes activation that tracks expected pain probability across both outcomes. Expected P(pain) = expected pain probability. (B) Activation tracking surprise for both received and avoided pain (i.e., absolute prediction error). Activation maps in A and B are thresholded at q < 0.05, false discovery rate (FDR) corrected for multiple comparisons across the whole brain. (C) Regions encoding surprise more for received than avoided pain, which cannot be explained by a general sensitivity to expected pain probability. These regions showed positive activation for both the first (A) and second (B) contrast, each thresholded at q < 0.05, FDR corrected. (D) Regions encoding surprise more for avoided than received pain, which cannot be explained by a general sensitivity to expected pain probability. These regions showed negative activation for the first, and positive activation for the second contrast, each thresholded at q < 0.05, FDR corrected. The line plots show the mean activity extracted from the brainstem and right amygdala (C) and left dlPFC and parietal (D) clusters per quartile of expected pain probability, illustrating the encoding of outcome-specific prediction errors in these regions. Error bars are standard errors.
Appendix 1—figure 1.
Appendix 1—figure 1.. Recovered posterior medians of α-pain , α-no-pain, and β- for fits to datasets that were simulated using the posterior medians from the placebo (black) and drug (purple) groups.
The recovered values of α-pain do not differ between the two groups, but recovered α-no-pain is reliably higher and recovered β- reliably lower in the drug group, mirroring the parameter estimates obtained from fits to the empirical data.
Appendix 2—figure 1.
Appendix 2—figure 1.. Axiomatic tests of brain activation encoding general aversive and appetitive prediction errors (N =74).
(A) Activation associated with the three axioms for aversive prediction errors in our task. Yellow regions showed the effects illustrated in the left panels, and blue regions showed the reverse effects (i.e., the axioms for appetitive prediction errors). All maps were thresholded at q < 0.05, false discovery rate (FDR) corrected for multiple comparisons across the whole brain, with higher voxel thresholds superimposed for display. (B) Conjunction results. Regions activated for each of the above three contrasts, all thresholded at q < 0.05 FDR corrected. Yellow and blue regions showed positive and negative responses for each contrast, respectively, thus encoded general aversive and appetitive prediction errors.

Similar articles

Cited by

References

    1. Ahn WY, Vasilev G, Lee SH, Busemeyer JR, Kruschke JK, Bechara A, Vassileva J. Decision-making in stimulant and opiate addicts in protracted abstinence: evidence from computational modeling with pure users. Frontiers in Psychology. 2014;5:849. doi: 10.3389/fpsyg.2014.00849. - DOI - PMC - PubMed
    1. Ahn WY, Haines N, Zhang L. Revealing Neurocomputational Mechanisms of Reinforcement Learning and Decision-Making With the hBayesDM Package. Computational Psychiatry (Cambridge, Mass.) 2017;1:24–57. doi: 10.1162/CPSY_a_00002. - DOI - PMC - PubMed
    1. Bartra O, McGuire JT, Kable JW. The valuation system: a coordinate-based meta-analysis of BOLD fMRI experiments examining neural correlates of subjective value. NeuroImage. 2013;76:412–427. doi: 10.1016/j.neuroimage.2013.02.063. - DOI - PMC - PubMed
    1. Beeler JA, Daw N, Frazier CRM, Zhuang X. Tonic dopamine modulates exploitation of reward learning. Frontiers in Behavioral Neuroscience. 2010;4:170. doi: 10.3389/fnbeh.2010.00170. - DOI - PMC - PubMed
    1. Beierholm U, Guitart-Masip M, Economides M, Chowdhury R, Düzel E, Dolan R, Dayan P. Dopamine modulates reward-related vigor. Neuropsychopharmacology. 2013;38:1495–1503. doi: 10.1038/npp.2013.48. - DOI - PMC - PubMed

Publication types