LORM: Learning to Optimize for Resource Management in Wireless Networks With Few Training Samples - Citegraph

Paper Info

Title
LORM: Learning to Optimize for Resource Management in Wireless Networks With Few Training Samples

Abstract
Effective resource management plays a pivotal role in wireless networks, which, unfortunately, typically results in challenging mixed-integer nonlinear programming (MINLP) problems. Machine learning-based methods have recently emerged as a disruptive way to obtain near-optimal performance for MINLPs with affordable computational complexity. There have been some attempts in applying such methods to resource management in wireless networks, but these attempts require huge amounts of training samples and lack the capability to handle constrained problems. Furthermore, they suffer from severe performance deterioration when the network parameters change, which commonly happens and is referred to as the <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">task mismatch</italic> problem. In this paper, to reduce the sample complexity and address the feasibility issue, we propose a framework of Learning to Optimize for Resource Management (LORM). In contrast to the end-to-end learning approach adopted in previous studies, LORM learns the optimal pruning policy in the branch-and-bound algorithm for MINLPs via a sample-efficient method, namely, <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">imitation learning</italic> . To further address the task mismatch problem, we develop a transfer learning method via self-imitation in LORM, named <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">LORM-TL</italic> , which can quickly adapt a pre-trained machine learning model to the new task with only a few additional <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">unlabeled</italic> training samples. Numerical simulations demonstrate that LORM outperforms specialized state-of-the-art algorithms and achieves near-optimal performance, while providing significant speedup compared with the branch-and-bound algorithm. Moreover, LORM-TL, by relying on a few unlabeled samples, achieves comparable performance with the model trained from scratch with sufficient labeled samples.

Year	DOI	Venue
2020	10.1109/TWC.2019.2947591	IEEE Transactions on Wireless Communications
Keywords	Field	DocType
Resource management,Optimization,Machine learning algorithms,Wireless networks,Training,Machine learning,Heuristic algorithms	Resource management,Wireless network,Computer network,Mathematics	Journal
Volume	Issue	ISSN
19	1	1536-1276
Citations	PageRank	References
9	0.48	0
Authors
4

Authors (4 rows)

Cited by (9 rows)

References (0 rows)

Name	Order	Citations	PageRank
Yifei Shen	1	9	0.48
Yuanming Shi	2	659	53.58
Jun Zhang	3	3772	190.36
K. B. Letaief	4	11078	879.10

1