Acquisition of chess knowledge in AlphaZero
- PMID: 36375061
- PMCID: PMC9704706
- DOI: 10.1073/pnas.2206625119
Acquisition of chess knowledge in AlphaZero
Abstract
We analyze the knowledge acquired by AlphaZero, a neural network engine that learns chess solely by playing against itself yet becomes capable of outperforming human chess players. Although the system trains without access to human games or guidance, it appears to learn concepts analogous to those used by human chess players. We provide two lines of evidence. Linear probes applied to AlphaZero's internal state enable us to quantify when and where such concepts are represented in the network. We also describe a behavioral analysis of opening play, including qualitative commentary by a former world chess champion.
Keywords: artificial intelligence; deep learning; interpretability; machine learning; reinforcement learning.
Conflict of interest statement
The authors declare no competing interest.
Figures
References
-
- Turing A., “Digital computers applied to games” in Faster than Thought: A Symposium on Digital Computing Machines, Bowden B. V., Ed. (Pitman Publishing, London, United Kingdom, 1953), pp. 286–310.
-
- Campbell M., Hoane A. J. Jr., Hsu F., Deep blue. Artif. Intell. 134, 57–83 (2002).
-
- Silver D., et al. ., A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play. Science 362, 1140–1144 (2018). - PubMed
-
- Olah C., et al. ., The building blocks of interpretability. Distill 3, e10 (2018).
MeSH terms
LinkOut - more resources
Full Text Sources
