Title
Neural Networks with Few Multiplications
Abstract
For most deep learning algorithms training is notoriously time consuming. Since most of the computation in training neural networks is typically spent on floating point multiplications, we investigate an approach to training that eliminates the need for most of these. Our method consists of two parts: First we stochastically binarize weights to convert multiplications involved in computing hidden states to sign changes. Second, while back-propagating error derivatives, in addition to binarizing the weights, we quantize the representations at each layer to convert the remaining multiplications into binary shifts. Experimental results across 3 popular datasets (MNIST, CIFAR10, SVHN) show that this approach not only does not hurt classification performance but can result in even better performance than standard stochastic gradient descent training, paving the way to fast, hardware-friendly training of neural networks.
Year
Venue
Field
2015
international conference on learning representations
Stochastic gradient descent,MNIST database,Computer science,Floating point,Artificial intelligence,Deep learning,Artificial neural network,Machine learning,Binary number,Computation
DocType
Volume
Citations 
Journal
abs/1510.03009
0
PageRank 
References 
Authors
0.34
0
4
Name
Order
Citations
PageRank
Zhouhan Lin141917.51
Matthieu Courbariaux256522.77
Roland Memisevic3111665.87
Yoshua Bengio4426773039.83