Robot navigation in a crowd by integrating deep reinforcement learning and online planning - Citegraph

Paper Info

Title
Robot navigation in a crowd by integrating deep reinforcement learning and online planning

Abstract
Navigating mobile robots along time-efficient and collision-free paths in crowds is still an open and challenging problem. The key is to build a profound understanding of the crowd for mobile robots, which is the basis of a proactive and foresighted policy. However, since the interaction mechanisms among pedestrians are complex and sophisticated, it is difficult to describe and model them accurately. For the excellent approximation capability of deep neural networks, deep reinforcement learning is a promising solution to the problem. However, current model-free learning-based approaches in crowd navigation always neglect planning and still lead to reactive collision avoidance policies and shortsighted behaviors. Meanwhile, most model-based learning-based approaches are based on state values, imposing a substantial computational burden. To address these problems, we propose a graph-based deep reinforcement learning method, social graph-based double dueling deep Q-network (SG-D3QN), that (i) introduces a social attention mechanism to extract an efficient graph representation for the crowd-robot state, (ii) extends the previous state value approximator to a state-action value approximator, (iii) further optimizes the state-action value approximator with simulated experiences generated by the learned environment model, and (iv) then proposes a human-like decision-making process by integrating model-free reinforcement learning and online planning. Experimental results indicate that our approach helps the robot understand the crowd and achieves a high success rate of more than 0.99 in the crowd navigation task. Compared with previous state-of-the-art algorithms, our approach achieves better performance. Furthermore, with the human-like decision-making process, our approach incurs less than half of the computational cost.

Year	DOI	Venue
2022	10.1007/s10489-022-03191-2	Applied Intelligence
Keywords	DocType	Volume
Crowd navigation, Mobile robot, Deep reinforcement learning, Online planning	Journal	52
Issue	ISSN	Citations
13	0924-669X	0
PageRank	References	Authors
0.34	7	6

Authors (6 rows)

Cited by (0 rows)

References (7 rows)

Name	Order	Citations	PageRank
Zhiqian Zhou	1	0	0.34
Pengming Zhu	2	0	0.34
Zhiwen Zeng	3	11	0.87
Junhao Xiao	4	18	5.60
Huimin Lu	5	0	0.34
Zongtan Zhou	6	412	33.89

1