MapGo - Model-Assisted Policy Optimization for Goal-Oriented Tasks. - Citegraph

Paper Info

Title
MapGo - Model-Assisted Policy Optimization for Goal-Oriented Tasks.

Abstract
In Goal-oriented Reinforcement learning, relabeling the raw goals in past experience to provide agents with hindsight ability is a major solution to the reward sparsity problem. In this paper, to enhance the diversity of relabeled goals, we develop FGI (Foresight Goal Inference), a new relabeling strategy that relabels the goals by looking into the future with a learned dynamics model. Besides, to improve sample efficiency, we propose to use the dynamics model to generate simulated trajectories for policy training. By integrating these two improvements, we introduce the MapGo framework (Model-Assisted Policy Optimization for Goal-oriented tasks). In our experiments, we first show the effectiveness of the FGI strategy compared with the hindsight one, and then show that the MapGo framework achieves higher sample efficiency when compared to model-free baselines on a set of complicated tasks.

Year	DOI	Venue
2021	10.24963/ijcai.2021/480	IJCAI
DocType	Citations	PageRank
Conference	0	0.34
References	Authors
0	10

Authors (10 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Menghui Zhu	1	0	1.01
Minghuan Liu	2	0	0.68
Jian Shen	3	22	5.46
Zhicheng Zhang	4	1	1.09
Sheng Chen	5	0	1.35
Weinan Zhang	6	1228	97.24
Deheng Ye	7	105	8.89
Yong Yu	8	7637	380.66
Qiang Fu	9	1	4.42
Wei Yang	10	93	27.50

1