Active Deep Q-learning with Demonstration. - Citegraph

Paper Info

Title
Active Deep Q-learning with Demonstration.

Abstract
Reinforcement learning (RL) is a machine learning technique aiming to learn how to take actions in an environment to maximize some kind of reward. Recent research has shown that although the learning efficiency of RL can be improved with expert demonstration, it usually takes considerable efforts to obtain enough demonstration. The efforts prevent training decent RL agents with expert demonstration in practice. In this work, we propose Active Reinforcement Learning with Demonstration, a new framework to streamline RL in terms of demonstration efforts by allowing the RL agent to query for demonstration actively during training. Under the framework, we propose Active deep Q-Network, a novel query strategy based on a classical RL algorithm called deep Q-network (DQN). The proposed algorithm dynamically estimates the uncertainty of recent states and utilizes the queried demonstration data by optimizing a supervised loss in addition to the usual DQN loss. We propose two methods of estimating the uncertainty based on two state-of-the-art DQN models, namely the divergence of bootstrapped DQN and the variance of noisy DQN. The empirical results validate that both methods not only learn faster than other passive expert demonstration methods with the same amount of demonstration and but also reach super-expert level of performance across four different tasks.

Year	DOI	Venue
2018	10.1007/s10994-019-05849-4	MACHINE LEARNING
Keywords	DocType	Volume
Active learning,Reinforcement learning,Learning from demonstration	Journal	109.0
Issue	ISSN	Citations
SP9-10	0885-6125	2
PageRank	References	Authors
0.43	1	4

Authors (4 rows)

Cited by (2 rows)

References (1 rows)

Name	Order	Citations	PageRank
Si-An Chen	1	3	0.80
Voot Tangkaratt	2	46	9.37
Hsuan-Tien Lin	3	829	74.77
Masashi Sugiyama	4	3353	264.24

1