<sc>ReLeQ</sc> : A Reinforcement Learning Approach for Automatic Deep Quantization of Neural Networks - Citegraph

Paper Info

Title
<sc>ReLeQ</sc> : A Reinforcement Learning Approach for Automatic Deep Quantization of Neural Networks

Abstract
Deep Quantization (below eight bits) can significantly reduce the DNN computation and storage by decreasing the bitwidth of network encodings. However, without arduous manual effort, this deep quantization can lead to significant accuracy loss, leaving it in a position of questionable utility. We propose a systematic approach to tackle this problem, by automating the process of discovering the bitwidths through an end-to-end deep reinforcement learning framework (RELEQ). This framework utilizes the sample efficiency of proximal policy optimization to explore the exponentially large space of possible assignment of the bitwidths to the layers. We show how RELEQ can balance speed and quality, and provide a heterogeneous bitwidth assignment for quantization of a large variety of deep networks with minimal accuracy loss (≤ 0.3% loss) while minimizing the computation and storage costs. With these DNNs, RELEQ enables conventional hardware and custom DNN accelerator to achieve 2.2× speedup over 8-bit execution.

Year	DOI	Venue
2020	10.1109/MM.2020.3009475	IEEE Micro
Keywords	DocType	Volume
Neural networks,quantization,autoML	Journal	40
Issue	ISSN	Citations
5	0272-1732	0
PageRank	References	Authors
0.34	0	5

Authors (5 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Ahmed T. Elthakeb	1	0	2.37
Prannoy Pilligundla	2	0	2.37
FatemehSadat Mireshghallah	3	0	0.68
Amir Yazdanbakhsh	4	241	15.28
H. Esmaeilzadeh	5	1443	69.71

1