Distributed Power Control for Large Energy Harvesting Networks: A Multi-Agent Deep Reinforcement Learning Approach - Citegraph

Paper Info

Title
Distributed Power Control for Large Energy Harvesting Networks: A Multi-Agent Deep Reinforcement Learning Approach

Abstract
In this paper, we develop a multi-agent reinforcement learning (MARL) framework to obtain online power control policies for a <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">large</italic> energy harvesting (EH) multiple access channel, when only causal information about the EH process and wireless channel is available. In the proposed framework, we model the online power control problem as a discrete-time mean-field game (MFG), and analytically show that the MFG has a <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">unique</italic> stationary solution. Next, we leverage the fictitious play property of the mean-field games, and the deep reinforcement learning technique to learn the stationary solution of the game, in a completely <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">distributed</italic> fashion. We analytically show that the proposed procedure converges to the unique stationary solution of the MFG. This, in turn, ensures that the optimal policies can be learned in a completely distributed fashion. In order to benchmark the performance of the distributed policies, we also develop a deep neural network (DNN) based centralized as well as distributed online power control schemes. Our simulation results show the efficacy of the proposed power control policies. In particular, the DNN based centralized power control policies provide a very good performance for large EH networks for which the design of optimal policies is intractable using the conventional methods such as Markov decision processes. Further, performance of both the distributed policies is close to the throughput achieved by the centralized policies.

Year	DOI	Venue
2019	10.1109/TCCN.2019.2949589	IEEE Transactions on Cognitive Communications and Networking
Keywords	DocType	Volume
Power control,Batteries,Transmitters,Fading channels,Reinforcement learning,Throughput,Wireless communication	Journal	5
Issue	ISSN	Citations
4	2332-7731	2
PageRank	References	Authors
0.39	0	5

Authors (5 rows)

Cited by (2 rows)

References (0 rows)

Name	Order	Citations	PageRank
Mohit K. Sharma	1	6	2.16
Alessio Zappone	2	1641	69.84
Mohamad Assaad	3	346	48.14
Mérouane Debbah	4	2	0.39
Spyridon Vassilaras	5	105	9.46

1