Mastering the game of Go with deep neural networks and tree search

David Silver¹, Aja Huang¹, Chris J Maddison¹, Arthur Guez¹, Laurent Sifre¹, George van den Driessche¹, Julian Schrittwieser¹, Ioannis Antonoglou¹, Veda Panneershelvam¹, Marc Lanctot¹, Sander Dieleman¹, Dominik Grewe¹, John Nham², Nal Kalchbrenner¹, Ilya Sutskever², Timothy Lillicrap¹, Madeleine Leach¹, Koray Kavukcuoglu¹, Thore Graepel¹, Demis Hassabis¹

Affiliations

¹ Google DeepMind, 5 New Street Square, London EC4A 3TW, UK.
² Google, 1600 Amphitheatre Parkway, Mountain View, California 94043, USA.

PMID: 26819042
DOI: 10.1038/nature16961

Mastering the game of Go with deep neural networks and tree search

David Silver et al. Nature. 2016.

. 2016 Jan 28;529(7587):484-9.

doi: 10.1038/nature16961.

Authors

Affiliations

¹ Google DeepMind, 5 New Street Square, London EC4A 3TW, UK.
² Google, 1600 Amphitheatre Parkway, Mountain View, California 94043, USA.

PMID: 26819042
DOI: 10.1038/nature16961

Abstract

The game of Go has long been viewed as the most challenging of classic games for artificial intelligence owing to its enormous search space and the difficulty of evaluating board positions and moves. Here we introduce a new approach to computer Go that uses 'value networks' to evaluate board positions and 'policy networks' to select moves. These deep neural networks are trained by a novel combination of supervised learning from human expert games, and reinforcement learning from games of self-play. Without any lookahead search, the neural networks play Go at the level of state-of-the-art Monte Carlo tree search programs that simulate thousands of random games of self-play. We also introduce a new search algorithm that combines Monte Carlo simulation with value and policy networks. Using this search algorithm, our program AlphaGo achieved a 99.8% winning rate against other Go programs, and defeated the human European Go champion by 5 games to 0. This is the first time that a computer program has defeated a human professional player in the full-sized game of Go, a feat previously thought to be at least a decade away.

PubMed Disclaimer

Comment in

Ready or Not, Here We Go: Decision-Making Strategies From Artificial Intelligence Based on Deep Neural Networks.
Dyster T, Sheth SA, McKhann GM 2nd. Dyster T, et al. Neurosurgery. 2016 Jun;78(6):N11-2. doi: 10.1227/01.neu.0000484053.82181.f6. Neurosurgery. 2016. PMID: 27191806 No abstract available.
Train artificial intelligence to be fair to farming.
Lin YP, Petway JR, Settele J. Lin YP, et al. Nature. 2017 Dec 21;552(7685):334. doi: 10.1038/d41586-017-08881-3. Nature. 2017. PMID: 29293217 No abstract available.

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- Nature Publishing Group
- Ovid Technologies, Inc.
Other Literature Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Mastering the game of Go with deep neural networks and tree search

Affiliations

Mastering the game of Go with deep neural networks and tree search

Authors

Affiliations

Abstract

Comment in

MeSH terms

LinkOut - more resources

Full Text Sources

Other Literature Sources