Combating False Negatives in Adversarial Imitation Learning - Citegraph

Paper Info

Title
Combating False Negatives in Adversarial Imitation Learning

Abstract
In adversarial imitation learning, a discriminator is trained to differentiate agent episodes from expert demonstrations representing the desired behavior. However, as the trained policy learns to be more successful, the negative examples (the ones produced by the agent) become increasingly similar to expert ones. Despite the fact that the task is successfully accomplished in some of the agent's trajectories, the discriminator is trained to output low values for them. We hypothesize that this inconsistent training signal for the discriminator can impede its learning, and consequently leads to worse overall performance of the agent. We show experimental evidence for this hypothesis and that the 'False Negatives' (i.e. successful agent episodes) significantly hinder adversarial imitation learning, which is the first contribution of this paper. Then, we propose a method to alleviate the impact of false negatives and test it on the BabyAI environment. This method consistently improves sample efficiency over the baselines by at least an order of magnitude.

Year	DOI	Venue
2021	10.1109/IJCNN52387.2021.9534032	2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN)
DocType	Volume	ISSN
Conference	34	2161-4393
Citations	PageRank	References
0	0.34	0
Authors
7

Authors (7 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Żołna Konrad	1	7	4.51
Chitwan Saharia	2	3	2.74
Boussioux Leonard	3	0	0.68
Hui David Yu-Tung	4	0	0.68
Maxime Chevalier-Boisvert	5	3	2.40
Dzmitry Bahdanau	6	2677	117.03
Yoshua Bengio	7	42677	3039.83

1