Title
Dynamic Bit-width Reconfiguration for Energy-Efficient Deep Learning Hardware
Abstract
Deep learning models have reached state of the art performance in many machine learning tasks. Benefits in terms of energy, bandwidth, latency, etc., can be obtained by evaluating these models directly within Internet of Things end nodes, rather than in the cloud. This calls for implementations of deep learning tasks that can run in resource limited environments with low energy footprints. Research and industry have recently investigated these aspects, coming up with specialized hardware accelerators for low power deep learning. One effective technique adopted in these devices consists in reducing the bit-width of calculations, exploiting the error resilience of deep learning. However, bit-widths are tipically set statically for a given model, regardless of input data. Unless models are retrained, this solution invariably sacrifices accuracy for energy efficiency. In this paper, we propose a new approach for implementing input-dependant dynamic bit-width reconfiguration in deep learning accelerators. Our method is based on a fully automatic characterization phase, and can be applied to popular models without retraining. Using the energy data from a real deep learning accelerator chip, we show that 50% energy reduction can be achieved with respect to a static bit-width selection, with less than 1% accuracy loss.
Year
DOI
Venue
2018
10.1145/3218603.3218611
ISLPED
Field
DocType
ISBN
Latency (engineering),Efficient energy use,Computer science,Implementation,Chip,Bandwidth (signal processing),Artificial intelligence,Deep learning,Computer hardware,Control reconfiguration,Cloud computing
Conference
978-1-4503-5704-3
Citations 
PageRank 
References 
2
0.42
16
Authors
3
Name
Order
Citations
PageRank
daniele jahier pagliari12113.19
Enrico Macii22405349.96
Massimo Poncino346057.48