AlphaZero for a Non-Deterministic Game - Citegraph

Paper Info

Title
AlphaZero for a Non-Deterministic Game

Abstract
The AlphaZero algorithm, developed by DeepMind, achieved superhuman levels of play in the games of chess, shogi, and Go, by learning without domain-specific knowledge except game rules. This paper investigates whether the algorithm can also learn theoretical values and optimal plays for non-deterministic games. Since the theoretical values of such games are expected win rates, not a simple win, loss, or draw, it is worthy investigating the ability of the AlphaZero algorithm to approximate expected win rates of positions. This paper also studies how the algorithm is influenced by a set of hyper-parameters. The tested non-deterministic game is a reduced and solved version of Chinese dark chess (CDC), called 2×4 CDC. The experiments show that the AlphaZero algorithm converges nearly to the theoretical values and the optimal plays in many of the settings of the hyper-parameters. To our knowledge, this is the first research paper that applies the AlphaZero algorithm to non-deterministic games.

Year	DOI	Venue
2018	10.1109/TAAI.2018.00034	2018 Conference on Technologies and Applications of Artificial Intelligence (TAAI)
Keywords	Field	DocType
AlphaZero,non-deterministic game,Chinese dark chess,theoretical value	Approximation algorithm,Computer science,Theoretical computer science,Probability distribution,Prediction algorithms	Conference
ISSN	ISBN	Citations
2376-6816	978-1-7281-1230-5	0
PageRank	References	Authors
0.34	10	4

Authors (4 rows)

Cited by (0 rows)

References (10 rows)

Name	Order	Citations	PageRank
Chu-Hsuan Hsueh	1	11	4.21
I-Chen Wu	2	208	55.03
Jr-Chang Chen	3	42	15.19
Tsan-sheng Hsu	4	737	101.00

1