APER-DDQN: UAV Precise Airdrop Method Based on Deep Reinforcement Learning - Citegraph

Paper Info

Title
APER-DDQN: UAV Precise Airdrop Method Based on Deep Reinforcement Learning

Abstract
Accuracy is the most critical factor that affects the effect of unmanned aerial vehicle (UAV) airdrop. The method to improve the accuracy of UAV airdrop based on traditional modeling has some limitations such as complex modeling, multiple model parameters and difficulty in considering all kinds of factors comprehensively when facing complex realistic environment. In order to solve the problem of UAV precision airdrop more conveniently, this paper introduces the deep reinforcement learning method and proposes an Adaptive Priority Experience Replay Deep Double Q-Network (APER-DDQN) algorithm based on Deep Double Q-Network (DDQN). This method introduces the priority experience replay mechanism based on DDQN, and adopts adaptive discount rate and learning rate to improve the decision-making performance and stability of the algorithm. Furthermore, this paper designs and builds a simulation experimental platform for algorithm training and testing. The experimental results show that our APER-DDQN has good performance and can more effectively solve the problem of UAV accurate airdrop while avoiding the complex modeling process. Firstly, in the training stage, compared with DDQN and Deep Q Network (DQN), APER-DDQN has faster convergence speed, higher reward and more stable performance. Then, in the test phase, compared with relying on human experience, our method shows higher average reward (average 3.01) and success rate (average 41%), and our method also has more advantages in performance compared with DDQN and DQN. Finally, extended experiments verify the generalization ability of APER-DDQN to different environments.

Year	DOI	Venue
2022	10.1109/ACCESS.2022.3174105	IEEE ACCESS
Keywords	DocType	Volume
Atmospheric modeling, Reinforcement learning, Autonomous aerial vehicles, Training, Decision making, Games, Costs, UAV airdrop, deep reinforcement learning, double deep Q-network, priority experience replay	Journal	10
ISSN	Citations	PageRank
2169-3536	0	0.34
References	Authors
0	4

Authors (4 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Yan Ouyang	1	0	0.68
Xinqing Wang	2	0	0.68
Ruizhe Hu	3	0	0.68
Honghui Xu	4	0	1.01

1