Title
DouZero+: Improving DouDizhu AI by Opponent Modeling and Coach-guided Learning
Abstract
Recent years have witnessed the great breakthrough of deep reinforcement learning (DRL) in various perfect and imperfect information games. Among these games, DouDizhu, a popular card game in China, is very challenging due to the imperfect information, large state and action space as well as elements of collaboration. Recently, a DouDizhu AI system called DouZero has been proposed. Trained using traditional Monte Carlo method with deep neural networks and self-play procedure without the abstraction of human prior knowledge, DouZero has achieved the best performance among all the existing DouDizhu AI programs. In this work, we propose to enhance DouZero by introducing opponent modeling into DouZero. Besides, we propose a novel coach network to further boost the performance of DouZero and accelerate its training process. With the integration of the above two techniques into DouZero, our DouDizhu AI system achieves better performance and ranks top in the Botzone leaderboard among more than 400 AI agents, including DouZero.
Year
DOI
Venue
2022
10.1109/CoG51982.2022.9893710
2022 IEEE Conference on Games (CoG)
Keywords
DocType
ISSN
DouDizhu,Reinforcement learning,Monte-Carl Method,Opponent Modeling,Coach Network
Conference
2325-4270
ISBN
Citations 
PageRank 
978-1-6654-5990-7
0
0.34
References 
Authors
13
5
Name
Order
Citations
PageRank
Youpeng Zhao100.68
Jian Zhao201.01
Xunhan Hu300.68
Wengang Zhou4122679.31
Houqiang Li52090172.30