Title
Robot navigation in a crowd by integrating deep reinforcement learning and online planning
Abstract
Navigating mobile robots along time-efficient and collision-free paths in crowds is still an open and challenging problem. The key is to build a profound understanding of the crowd for mobile robots, which is the basis of a proactive and foresighted policy. However, since the interaction mechanisms among pedestrians are complex and sophisticated, it is difficult to describe and model them accurately. For the excellent approximation capability of deep neural networks, deep reinforcement learning is a promising solution to the problem. However, current model-free learning-based approaches in crowd navigation always neglect planning and still lead to reactive collision avoidance policies and shortsighted behaviors. Meanwhile, most model-based learning-based approaches are based on state values, imposing a substantial computational burden. To address these problems, we propose a graph-based deep reinforcement learning method, social graph-based double dueling deep Q-network (SG-D3QN), that (i) introduces a social attention mechanism to extract an efficient graph representation for the crowd-robot state, (ii) extends the previous state value approximator to a state-action value approximator, (iii) further optimizes the state-action value approximator with simulated experiences generated by the learned environment model, and (iv) then proposes a human-like decision-making process by integrating model-free reinforcement learning and online planning. Experimental results indicate that our approach helps the robot understand the crowd and achieves a high success rate of more than 0.99 in the crowd navigation task. Compared with previous state-of-the-art algorithms, our approach achieves better performance. Furthermore, with the human-like decision-making process, our approach incurs less than half of the computational cost.
Year
DOI
Venue
2022
10.1007/s10489-022-03191-2
Applied Intelligence
Keywords
DocType
Volume
Crowd navigation, Mobile robot, Deep reinforcement learning, Online planning
Journal
52
Issue
ISSN
Citations 
13
0924-669X
0
PageRank 
References 
Authors
0.34
7
6
Name
Order
Citations
PageRank
Zhiqian Zhou100.34
Pengming Zhu200.34
Zhiwen Zeng3110.87
Junhao Xiao4185.60
Huimin Lu500.34
Zongtan Zhou641233.89