Differentiable Quantization of Deep Neural Networks. - Citegraph

Paper Info

Title
Differentiable Quantization of Deep Neural Networks.

Abstract
We propose differentiable quantization (DQ) for efficient deep neural network (DNN) inference where gradient descent is used to learn the quantizer's step size, dynamic range and bitwidth. Training with differentiable quantizers brings two main benefits: first, DQ does not introduce hyperparameters; second, we can learn for each layer a different step size, dynamic range and bitwidth. Our experiments show that DNNs with heterogeneous and learned bitwidth yield better performance than DNNs with a homogeneous one. Further, we show that there is one natural DQ parametrization especially well suited for training. We confirm our findings with experiments on CIFAR-10 and ImageNet and we obtain quantized DNNs with learned quantization parameters achieving state-of-the-art performance.

Year	Venue	DocType
2019	arXiv: Learning	Journal
Volume	Citations	PageRank
abs/1905.11452	1	0.35
References	Authors
0	8

Authors (8 rows)

Cited by (1 rows)

References (0 rows)

Name	Order	Citations	PageRank
Stefan Uhlich	1	35	7.62
Lukas Mauch	2	13	4.97
Kazuki Yoshiyama	3	4	1.46
Fabien Cardinaux	4	279	19.00
Javier Alonso García	5	4	1.46
Stephen Tiedemann	6	4	1.46
Thomas Kemp	7	246	30.93
Akio Nakamura	8	62	14.45

1