Title
Mastering the game of Go with deep neural networks and tree search.
Abstract
The game of Go has long been viewed as the most challenging of classic games for artificial intelligence owing to its enormous search space and the difficulty of evaluating board positions and moves. Here we introduce a new approach to computer Go that uses 'value networks' to evaluate board positions and 'policy networks' to select moves. These deep neural networks are trained by a novel combination of supervised learning from human expert games, and reinforcement learning from games of self-play. Without any lookahead search, the neural networks play Go at the level of state-of-the-art Monte Carlo tree search programs that simulate thousands of random games of self-play. We also introduce a new search algorithm that combines Monte Carlo simulation with value and policy networks. Using this search algorithm, our program AlphaGo achieved a 99.8% winning rate against other Go programs, and defeated the human European Go champion by 5 games to 0. This is the first time that a computer program has defeated a human professional player in the full-sized game of Go, a feat previously thought to be at least a decade away.
Year
DOI
Venue
2016
10.1038/nature16961
NATURE
Keywords
Field
DocType
Computer science,Computational science,Reward
Combinatorial game theory,Monte Carlo tree search,Search algorithm,Game mechanics,Computer science,Computer Go,Artificial intelligence,General video game playing,Late Move Reductions,Reinforcement learning
Journal
Volume
Issue
ISSN
529
7587
0028-0836
Citations 
PageRank 
References 
1643
63.63
32
Authors
20
Search Limit
1001000
Name
Order
Citations
PageRank
David Silver18252363.86
Aja Huang2224688.44
Maddison, Chris J.3179175.44
Arthur Guez42481100.43
Laurent Sifre5247094.03
George van den Driessche6224584.67
Julian Schrittwieser7220582.69
Ioannis Antonoglou82977114.70
Veda Panneershelvam9164363.63
Marc Lanctot10212197.97
Sander Dieleman112607102.93
Dominik Grewe12164363.63
John Nham13164363.63
Nal Kalchbrenner143662149.32
Ilya Sutskever15258141120.24
Timothy P. Lillicrap164377170.65
Madeleine Leach17164363.63
Koray Kavukcuoglu1810189504.11
Thore Graepel194211242.71
Demis Hassabis204924191.12