Title
Optimal UAV Base Station Trajectories Using Flow-Level Models for Reinforcement Learning
Abstract
Cellular base stations (BS) and remote radio heads can be mounted on unmanned aerial vehicles (UAV) for flexible, traffic-aware deployment. These UAV base station networks (UAVBSN) promise an unprecendented degree of freedom that can be exploited for spectral efficiency gains as well as optimal network utilization. However, the current literature lacks realistic radio and traffic models for UAVBSN deployment planning and for performance evaluation. In this paper, we propose flow-level models (FLM) for realistically characterizing the UAVBSN performance in terms of a broad range of flow- and system-level metrics. Further, we propose a deep reinforcement learning (DRL) approach that relies on the UAVBSN FLM for learning the optimal traffic-aware UAV trajectories. For a given user traffic density and starting UAV locations, our RL approach learns the optimal UAV trajectories offline that maximizes a cumulative performance metric. We then execute the learned UAV trajectories in a discrete event simulator to evaluate online UAVBSN performance. For <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">${M}\pmb {=}$ </tex-math></inline-formula> 9 UAVs deployed in a simulated Downtown San Francisco model, where the UAV trajectories are defined by <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">${N}\pmb {=}$ </tex-math></inline-formula> 20 discrete actions, our approach achieves approximately a three-fold increase in the average user throughput compared to the initial UAV placement, while simultaneously balancing traffic loads across the BSs.
Year
DOI
Venue
2019
10.1109/TCCN.2019.2948324
IEEE Transactions on Cognitive Communications and Networking
Keywords
Field
DocType
Unmanned aerial vehicles,Trajectory,Base stations,Reinforcement learning,Wireless communication,Throughput,Measurement
Base station,Computer science,Flow (psychology),Real-time computing,Reinforcement learning
Journal
Volume
Issue
ISSN
5
4
2332-7731
Citations 
PageRank 
References 
3
0.38
0
Authors
3
Name
Order
Citations
PageRank
Vidit Saxena1254.61
Joakim Jalden224321.59
Henrik Klessig330.38