Theory of reinforcement schedules
- PMID: 37706228
- DOI: 10.1002/jeab.880
Theory of reinforcement schedules
Abstract
The three principles of reinforcement are (1) events such as incentives and reinforcers increase the activity of an organism; (2) that activity is bounded by competition from other responses; and (3) animals approach incentives and their signs, guided by their temporal and physical conditions, together called the "contingencies of reinforcement." Mathematical models of each of these principles comprised mathematical principles of reinforcement (MPR; Killeen, 1994). Over the ensuing decades, MPR was extended to new experimental contexts. This article reviews the basic theory and its extensions to satiation, warm-up, extinction, sign tracking, pausing, and sequential control in progressive-ratio and multiple schedules. In the latter cases, a single equation balancing target and competing responses governs behavioral contrast and behavioral momentum. Momentum is intrinsic in the fundamental equations, as behavior unspools more slowly from highly aroused responses conditioned by higher rates of incitement than it does from responses from leaner contexts. Habits are responses that have accrued substantial behavioral momentum. Operant responses, being predictors of reinforcement, are approached by making them: The sight and feel of a paw on a lever is approached by placing paw on lever, as attempted for any sign of reinforcement. Behavior in concurrent schedules is governed by approach to momentarily richer patches (melioration). Applications of MPR in behavioral pharmacology and delay discounting are noted.
Keywords: MPR; actions; contrast; momentum; principles of reinforcement; reinforcement schedules.
© 2023 Society for the Experimental Analysis of Behavior.
References
REFERENCES
-
- Adams, C. D. (1980). Post-conditioning devaluation of an instrumental reinforcer has no effect on extinction performance. Quarterly Journal of Experimental Psychology, 32(3), 447-458. https://doi.org/10.1080/14640748008401838
-
- Anger, D. (1963). The role of temporal discriminations in the reinforcement of Sidman avoidance behavior. Journal of the Experimental Analysis of Behavior, 6(S3), 477-506. https://doi.org/10.1901/jeab.1963.6-s477
-
- Aparicio, C. F., & Baum, W. M. (2006). Fix and sample with rats in the dynamics of choice. Journal of the Experimental Analysis of Behavior, 86(1), 43-63. https://doi.org/10.1901/jeab.2006.57-05
-
- Avila, I., Reilly, M. P., Sanabria, F., Posadas-Sanchez, D., Chavez, C. L., Banerjee, N., Killeen, P. R., & Castañeda, E. (2009). Modeling operant behavior in the Parkinsonian rat. Behavioural Brain Research, 198(2), 298-305. https://doi.org/10.1016/j.bbr.2008.11.033
-
- Balleine, B. W. (2019). The meaning of behavior: Discriminating reflex and volition in the brain. Neuron, 104(1), 47-62. https://doi.org/10.1016/j.neuron.2019.09.024
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
