Title
Fast Adaptation to External Agents via Meta Imitation Counterfactual Regret Advantage.
Abstract
This paper focuses on the multi-agent credit assignment problem. We propose a novel multi-agent reinforcement learning algorithm called meta imitation counterfactual regret advantage (MICRA) and a three-phase framework for training, adaptation, and execution of MICRA. The key features are: (1) a counterfactual regret advantage is proposed to optimize the target agents' policy; (2) a meta-imitator is designed to infer the external agents' policies. Results show that MICRA outperforms state-of-the-art algorithms.
Year
DOI
Venue
2021
10.5555/3463952.3464209
AAMAS
DocType
Citations 
PageRank 
Conference
0
0.34
References 
Authors
0
6
Name
Order
Citations
PageRank
mingyue zhang11310.00
Zhi Jin21493137.87
Yang Xu3165.33
Zehan Shen400.34
Kun Liu501.01
Keyu Pan600.34