Q-Value Weighted Regression: Reinforcement Learning with Limited Data - Citegraph

Paper Info

Title
Q-Value Weighted Regression: Reinforcement Learning with Limited Data

Abstract
Sample efficiency and performance in the offline setting have emerged as significant challenges of deep reinforcement learning. We introduce Q-Value Weighted Regression (QWR), a simple RL algorithm that excels in these aspects. QWR is an extension of Advantage Weighted Regression (AWR), an off-policy actor-critic algorithm that performs very well on continuous control tasks, also in the offline setting, but has low sample efficiency and struggles with high-dimensional observation spaces. We perform an analysis of AWR that explains its shortcomings and use these insights to motivate QWR. We show experimentally that QWR matches the state-of-the-art algorithms both on tasks with continuous and discrete actions. In particular, QWR yields results on par with SAC on the MuJoCo suite and - with the same set of hyperparameters - yields results on par with a highly tuned Rainbow implementation on a set of Atari games. We also verify that QWR performs well in the offline RL setting.

Year	DOI	Venue
2022	10.1109/IJCNN55064.2022.9892633	IEEE International Joint Conference on Neural Network (IJCNN)
DocType	Citations	PageRank
Conference	0	0.34
References	Authors
0	5

Authors (5 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Piotr Kozakowski	1	0	1.01
Łukasz Kaiser	2	2307	89.08
Henryk Michalewski	3	0	0.68
Afroz Mohiuddin	4	0	2.03
Katarzyna Kańska	5	0	0.34

1