Dynamic response-by-response models of matching behavior in rhesus monkeys
- PMID: 16596980
- PMCID: PMC1389781
- DOI: 10.1901/jeab.2005.110-04
Dynamic response-by-response models of matching behavior in rhesus monkeys
Abstract
We studied the choice behavior of 2 monkeys in a discrete-trial task with reinforcement contingencies similar to those Herrnstein (1961) used when he described the matching law. In each session, the monkeys experienced blocks of discrete trials at different relative-reinforcer frequencies or magnitudes with unsignalled transitions between the blocks. Steady-state data following adjustment to each transition were well characterized by the generalized matching law; response ratios undermatched reinforcer frequency ratios but matched reinforcer magnitude ratios. We modelled response-by-response behavior with linear models that used past reinforcers as well as past choices to predict the monkeys' choices on each trial. We found that more recently obtained reinforcers more strongly influenced choice behavior. Perhaps surprisingly, we also found that the monkeys' actions were influenced by the pattern of their own past choices. It was necessary to incorporate both past reinforcers and past choices in order to accurately capture steady-state behavior as well as the fluctuations during block transitions and the response-by-response patterns of behavior. Our results suggest that simple reinforcement learning models must account for the effects of past choices to accurately characterize behavior in this task, and that models with these properties provide a conceptual tool for studying how both past reinforcers and past choices are integrated by the neural systems that generate behavior.
Figures











Similar articles
-
Linear-Nonlinear-Poisson models of primate choice dynamics.J Exp Anal Behav. 2005 Nov;84(3):581-617. doi: 10.1901/jeab.2005.23-05. J Exp Anal Behav. 2005. PMID: 16596981 Free PMC article.
-
The generalized matching law as a predictor of choice between cocaine and food in rhesus monkeys.Psychopharmacology (Berl). 2002 Oct;163(3-4):319-26. doi: 10.1007/s00213-002-1012-7. Epub 2002 Mar 1. Psychopharmacology (Berl). 2002. PMID: 12373433
-
Remembering as discrimination in delayed matching to sample: discriminability and bias.Learn Behav. 2007 Aug;35(3):177-83. doi: 10.3758/bf03193053. Learn Behav. 2007. PMID: 17918423
-
Choice behavior in transition: development of preference for the higher probability of reinforcement.J Exp Anal Behav. 1990 May;53(3):409-22. doi: 10.1901/jeab.1990.53-409. J Exp Anal Behav. 1990. PMID: 2341823 Free PMC article.
-
A model for discriminating reinforcers in time and space.Behav Processes. 2016 Jun;127:62-73. doi: 10.1016/j.beproc.2016.03.010. Epub 2016 Mar 22. Behav Processes. 2016. PMID: 27016156 Review.
Cited by
-
Dynamic estimation of task-relevant variance in movement under risk.J Neurosci. 2012 Sep 12;32(37):12702-11. doi: 10.1523/JNEUROSCI.6160-11.2012. J Neurosci. 2012. PMID: 22972994 Free PMC article.
-
Learning where to look for a hidden target.Proc Natl Acad Sci U S A. 2013 Jun 18;110 Suppl 2(Suppl 2):10438-45. doi: 10.1073/pnas.1301216110. Epub 2013 Jun 10. Proc Natl Acad Sci U S A. 2013. PMID: 23754404 Free PMC article.
-
Multiple brain networks contribute to the acquisition of bias in perceptual decision-making.Front Neurosci. 2015 Mar 5;9:63. doi: 10.3389/fnins.2015.00063. eCollection 2015. Front Neurosci. 2015. PMID: 25798082 Free PMC article.
-
Adaptive learning and decision-making under uncertainty by metaplastic synapses guided by a surprise detection system.Elife. 2016 Aug 9;5:e18073. doi: 10.7554/eLife.18073. Elife. 2016. PMID: 27504806 Free PMC article.
-
Reinforcement learning with associative or discriminative generalization across states and actions: fMRI at 3 T and 7 T.Hum Brain Mapp. 2022 Oct 15;43(15):4750-4790. doi: 10.1002/hbm.25988. Epub 2022 Jul 21. Hum Brain Mapp. 2022. PMID: 35860954 Free PMC article.
References
-
- Akaike H. A new look at the statistical model identification. IEEE Transaction on Automatic Control. 1974;19:716–723.
-
- Anderson K.G, Velkey A.J, Woolverton W.L. The generalized matching law as a predictor of choice between cocaine and food in rhesus monkeys. Psychopharmacology. 2002;163:319–326. - PubMed
-
- Barraclough D.J, Conroy M.L, Lee D. Prefrontal cortex and decision making in a mixed-strategy game. Nature Neuroscience. 2004;7:404–410. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Miscellaneous