Fast Adaptation to External Agents via Meta Imitation Counterfactual Regret Advantage. - Citegraph

Paper Info

Title
Fast Adaptation to External Agents via Meta Imitation Counterfactual Regret Advantage.

Abstract
This paper focuses on the multi-agent credit assignment problem. We propose a novel multi-agent reinforcement learning algorithm called meta imitation counterfactual regret advantage (MICRA) and a three-phase framework for training, adaptation, and execution of MICRA. The key features are: (1) a counterfactual regret advantage is proposed to optimize the target agents' policy; (2) a meta-imitator is designed to infer the external agents' policies. Results show that MICRA outperforms state-of-the-art algorithms.

Year	DOI	Venue
2021	10.5555/3463952.3464209	AAMAS
DocType	Citations	PageRank
Conference	0	0.34
References	Authors
0	6

Authors (6 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
mingyue zhang	1	13	10.00
Zhi Jin	2	1493	137.87
Yang Xu	3	16	5.33
Zehan Shen	4	0	0.34
Kun Liu	5	0	1.01
Keyu Pan	6	0	0.34

1