Path Planning for Mobile Robots Based on TPR-DDPG - Citegraph

Paper Info

Title
Path Planning for Mobile Robots Based on TPR-DDPG

Abstract
Path planning is one of the key research topics in robotics. Nowadays, researchers pay more attention to reinforcement learning (RL) and deep learning (DL) because of RL's good generality, self-learning ability, and DL's super leaning ability. Deep deterministic policy gradient (DDPG) algorithm, which combines the architectures of deep Q-learning (DQN), deterministic policy gradient (DPG) and Actor-Critic (AC), is different from the traditional RL methods and is suitable for continuous action space. Therefore, TPR-DDPG based path planning algorithm for mobile robots is proposed. In the algorithm, the state is preprocessed by various normalization methods, and complete reward-functions are designed to make agents reach the target point quickly by optimal paths in complex environments. The BatchNorm layer is added to the policy network, which ensures the stability of the algorithm. Finally, experimental results of agents' reaching the target points successfully through the paths generated by the improved DDPG validate the effectiveness of the proposed algorithm.

Year	DOI	Venue
2021	10.1109/IJCNN52387.2021.9533570	2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN)
Keywords	DocType	ISSN
path planning, deep deterministic policy gradient (DDPG), policy network, value network, mobile robots	Conference	2161-4393
Citations	PageRank	References
0	0.34	0
Authors
5

Authors (5 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Yaping Zhao	1	0	0.34
Xiuqing Wang	2	23	3.98
Ruiyi Wang	3	0	0.34
Yunpeng Yang	4	0	0.34
Feng Lv	5	1	1.38

1