Title
Decoding Polar Codes with Reinforcement Learning.
Abstract
In this paper we address the problem of selecting factor-graph permutations of polar codes under belief propagation (BP) decoding to significantly improve the error-correction performance of the code. In particular, we formalize the factor-graph permutation selection as the multi-armed bandit problem in reinforcement learning and propose a decoder that acts like an online-learning agent that learns to select the good factor-graph permutations during the course of decoding. We use state-of-the-art algorithms for the multi-armed bandit problem and show that for a 5G polar codes of length 128 with 64 information bits, the proposed decoder has an error-correction performance gain of around 0.125 dB at the target frame error rate of 10−4, when compared to the approach that randomly selects the factor-graph permutations.
Year
DOI
Venue
2020
10.1109/GLOBECOM42002.2020.9348007
GLOBECOM
DocType
Citations 
PageRank 
Conference
0
0.34
References 
Authors
0
3
Name
Order
Citations
PageRank
Nghia Doan154.21
Seyyed Ali Hashemi200.34
Warren J. Gross31106113.38