Melioration Learning in Two-Person Games
- PMID: 27851815
- PMCID: PMC5112854
- DOI: 10.1371/journal.pone.0166708
Melioration Learning in Two-Person Games
Abstract
Melioration learning is an empirically well-grounded model of reinforcement learning. By means of computer simulations, this paper derives predictions for several repeatedly played two-person games from this model. The results indicate a likely convergence to a pure Nash equilibrium of the game. If no pure equilibrium exists, the relative frequencies of choice may approach the predictions of the mixed Nash equilibrium. Yet in some games, no stable state is reached.
Conflict of interest statement
The author has declared that no competing interests exist.
Figures














Similar articles
-
Spike-based decision learning of Nash equilibria in two-player games.PLoS Comput Biol. 2012;8(9):e1002691. doi: 10.1371/journal.pcbi.1002691. Epub 2012 Sep 27. PLoS Comput Biol. 2012. PMID: 23028289 Free PMC article.
-
Dynamical selection of Nash equilibria using reinforcement learning: Emergence of heterogeneous mixed equilibria.PLoS One. 2018 Jul 9;13(7):e0196577. doi: 10.1371/journal.pone.0196577. eCollection 2018. PLoS One. 2018. PMID: 29985923 Free PMC article.
-
Operant matching as a Nash equilibrium of an intertemporal game.Neural Comput. 2009 Oct;21(10):2755-73. doi: 10.1162/neco.2009.09-08-854. Neural Comput. 2009. PMID: 19635021
-
Mechanisms of reinforcement learning and decision making in the primate dorsolateral prefrontal cortex.Ann N Y Acad Sci. 2007 May;1104:108-22. doi: 10.1196/annals.1390.007. Epub 2007 Mar 8. Ann N Y Acad Sci. 2007. PMID: 17347332 Review.
-
[Mathematical models of decision making and learning].Brain Nerve. 2008 Jul;60(7):791-8. Brain Nerve. 2008. PMID: 18646619 Review. Japanese.
Cited by
-
Multi-agent reinforcement learning with approximate model learning for competitive games.PLoS One. 2019 Sep 11;14(9):e0222215. doi: 10.1371/journal.pone.0222215. eCollection 2019. PLoS One. 2019. PMID: 31509568 Free PMC article.
References
-
- Young HP. Strategic Learning and its Limits. New York: Oxford University Press; 2004. 10.1093/acprof:oso/9780199269181.001.0001 - DOI
-
- Hart S, Mas-Colell A. Uncoupled dynamics do not lead to Nash equilibrium. The American Economic Review. 2003;93(5):1830–1836. 10.1257/000282803322655581 - DOI
-
- Foster DP, Young HP. Regret testing: Learning to play Nash equilibrium without knowing you have an opponent. Theoretical Economics. 2006;1(3):341–367.
-
- Babichenko Y. Completely uncoupled dynamics and Nash equilibria. Games and Economic Behavior. 2012;76(1):1–14. 10.1016/j.geb.2012.06.004 - DOI
-
- Germano F, Lugosi G. Global Nash convergence of Foster and Young’s regret testing. Games and Economic Behavior. 2007;60(1):135–154.
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources