Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Jul 7;24(7):102821.
doi: 10.1016/j.isci.2021.102821. eCollection 2021 Jul 23.

Reward boosts reinforcement-based motor learning

Affiliations

Reward boosts reinforcement-based motor learning

Pierre Vassiliadis et al. iScience. .

Abstract

Besides relying heavily on sensory and reinforcement feedback, motor skill learning may also depend on the level of motivation experienced during training. Yet, how motivation by reward modulates motor learning remains unclear. In 90 healthy subjects, we investigated the net effect of motivation by reward on motor learning while controlling for the sensory and reinforcement feedback received by the participants. Reward improved motor skill learning beyond performance-based reinforcement feedback. Importantly, the beneficial effect of reward involved a specific potentiation of reinforcement-related adjustments in motor commands, which concerned primarily the most relevant motor component for task success and persisted on the following day in the absence of reward. We propose that the long-lasting effects of motivation on motor learning may entail a form of associative learning resulting from the repetitive pairing of the reinforcement feedback and reward during training, a mechanism that may be exploited in future rehabilitation protocols.

Keywords: Behavioral neuroscience; Cognitive neuroscience; Neuroscience; Sensory neuroscience.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

None
Graphical abstract
Figure 1
Figure 1
The motor skill learning task (A) Time course of a trial in the motor skill learning task. Each trial started with the appearance of a sidebar and a target. After a variable preparatory phase (800-1000ms), a cursor appeared in the sidebar, playing the role of a “Go” signal. At this moment, participants were required to pinch the force transducer to bring the cursor into the target as quickly as possible and maintain it there until the end of the task (2000ms). Notably, on most trials, the cursor disappeared halfway toward the target (as displayed here). Then, a reinforcement feedback was provided in the form of a colored circle for 1000ms and provided binary knowledge of performance (Success or Failure in Block-SR and Block-SRR) or was non-informative (Block-S). The reinforcement feedback was determined based on the comparison between the Error on the trial and the individual success threshold (computed in the Calibration block, see STAR Methods). Finally, each trial ended with a reminder of the color/feedback association and potential reward associated to good performance (1500ms). (B) Experimental procedure. On Day 1, all participants performed two familiarization blocks in a Block-SR condition. The first one involved full vision of the cursor while the second one provided only partial vision and served to calibrate the difficulty of the task on an individual basis (See STAR Methods). Then, Pre- and Post-training Block-SR assessments were separated by 6 blocks of training in the condition corresponding to each individual group (Block-S for Group-S, Block-SR for Group-SR and Block-SRR for Group-SRR). Day 2 involved a Familiarization block (with partial vision) followed by a Re-test assessment (4 Block-SR pooled together). There was no recalibration on Day 2. (C) Example of a force profile. Force applied (in % of MVC) during the task. Participants were asked to approximate the TargetForce as quickly and accurately as possible to minimize the Error (gray shaded area). As shown on the Figure, this Error depended on the speed of force initiation (ForceInitiation) and on the accuracy of the maintained force, as reflected by its amplitude with respect to the TargetForce (ForceAmplError) and its variability (ForceVariability). Note that the first 150ms of each trial were not considered for the computation of the Error.
Figure 2
Figure 2
Effect of reward on motor skill learning (A) Error. Average Error is represented across practice for the three experimental groups (gray: Group-S, light green: Group-SR, dark green: Group-SRR). The gray shaded area highlights the blocks concerned by the reinforcement manipulation. The remaining blocks were performed with knowledge of performance only (i.e., in a Block-SR setting). (B) Skill learning. Bar plot (left) and violin plot (right, each dot = one subject) representing skill learning (quantified as the Error in Post-training blocks expressed in percentage of Pre-training blocks) in the three experimental groups. Skill learning was significantly enhanced in Group-SRR compared to the two other groups. This result remained significant when removing the subject showing an extreme value in the Group-SR (ANOVA: F(2,86) = 6.44, p = 0.0025, partial η2 = 0.13; post-hocs; Group-SRR vs. Group-SR: p = 0.027; Group-SRR vs. Group-S: p = 0.00064; Group-SR vs. Group-S: p = 0.21). (C) Skill maintenance. Bar plot (left) and violin plot (right) representing skill maintenance quantified as the Error in Re-test blocks expressed in percentage of Pre-training blocks) in the three experimental groups. (D) Success. Proportion of successful trials for each block. (E) Force profiles. Individual force profiles of one representative subject of Group-S (left), Group-SR (middle) and Group-SRR (right) in the Pre- (gray) and Post-training blocks (blue). Note the better approximation of the TargetForce and the reduced inter-trial variability at Post-training in the exemplar subject of Group-SRR. ∗: significant difference between groups (p<0.05). #: significant difference within a group between normalized Post-training Error and a constant value of 100% (p<0.017 to account for multiple comparisons). Data are represented as mean ± SE
Figure 3
Figure 3
Between-trial adjustments in the Error (A) Reinforcement-based adjustments in the Error during Day 1 training. Absolute between-trial adjustments in the Error (ErrorBTC = |Errorn+1-Errorn|) according to the reinforcement feedback (i.e., Success or Failure) encountered at trialn in the three GroupTYPES (gray: Group-S, light green: Group-SR, dark green: Group-SRR). Notably, these bins of trials were constituted based on the success threshold-normalized Error at trialn in order to compare adjustments in motor commands following trials of similar Error in the three groups. Stars denote significant group differences in ErrorBTC for a given outcome (left panel, see STAR Methods). Reinforcement-based adjustments (ErrorBTC after Failure in percentage of ErrorBTC after Success) were compared in the three GroupTYPES (right panel). (B) Correlations between the magnitude of reinforcement-based adjustments in the Error and the average success rate on the next trial, showing the relevance of these adjustments in the present task. Each dot represents a subject. (C, D) Same for Day 2 training. Note that reinforcement-based adjustments in motor commands remained amplified in GroupSRR, despite the absence of reward on Day 2. (E) Sensory-based adjustments in the Error during Day 1 training. ErrorBTC following trialsn with Failures of different Error magnitudes (left panel). Sensory-based adjustments (ErrorBTC after Large Failure in percentage of ErrorBTC after Small Failure) were compared in the three GroupTYPES (right panel). (F) Correlations between the magnitude of sensory-based adjustments in the Error and the probability of success on the next trial, showing the relevance of these adjustments for task success. (G, H) Same for Day 2 training. ∗: p < 0.05. Data are represented as mean ± SE.
Figure 4
Figure 4
Between-trial adjustments in initiation time, amplitude error and variability Reinforcement-based adjustments in the ForceInitiation (A), ForceAmplError (B) and ForceVariability (C). Absolute between-trial changes (BTC) for each motor component (ForceBTC = |Forcen+1-Forcen|) according to the reinforcement feedback (i.e., Success or Failure) encountered at trialn in the three GroupTYPES (gray: Group-S, light green: Group-SR, dark green: Group-SRR). Notably, these bins of trials were constituted based on the success threshold-normalized Error at trialn. Stars denote significant group differences in ErrorBTC for a given outcome (left panel). Reinforcement-based adjustments (ForceBTC after Failure in percentage of ForceBTC after Success) in the three GroupTYPES (right panel). Sensory-based adjustments in the ForceInitiation (D), ForceAmplError (E) and ForceVariability (F). ForceBTC following trialsn with Failures of different Error magnitudes (left panel). Sensory-based adjustments (ForceBTC after Large Failure in percentage of ForceBTC after Small Failure) in the three GroupTYPES (right panel). ∗: p < 0.05. Data are represented as mean ± SE.

References

    1. Abe M., Schambra H., Wassermann E.M., Luckenbaugh D., Schweighofer N., Cohen L.G. Reward improves long-term retention of a motor memory through induction of offline memory gains. Curr. Biol. 2011;21:557–562. doi: 10.1016/j.cub.2011.02.030. - DOI - PMC - PubMed
    1. Avraham G., Taylor J.A., Ivry R.B., McDougle S.D. An associative learning account of sensorimotor adaptation. bioRxiv. 2020 doi: 10.1101/2020.09.14.297143. - DOI - PMC - PubMed
    1. Balleine B.W., O’Doherty J.P. Human and rodent homologies in action control: corticostriatal determinants of goal-directed and habitual action. Neuropsychopharmacology. 2010;35:48–69. doi: 10.1038/npp.2009.131. - DOI - PMC - PubMed
    1. Barron A.B., Søvik E., Cornish J.L. The roles of dopamine and related compounds in reward-seeking behavior across animal phyla. Front. Behav.Neurosci. 2010;4:1–9. doi: 10.3389/fnbeh.2010.00163. - DOI - PMC - PubMed
    1. Berke J.D. What does dopamine mean? Nat. Neurosci. 2018;21:787–793. doi: 10.1038/s41593-018-0152-y. - DOI - PMC - PubMed

LinkOut - more resources