Title
Approximate FPGA-based LSTMs under Computation Time Constraints.
Abstract
Recurrent Neural Networks, with the prominence of Long Short-Term Memory (LSTM) networks, have demonstrated state-of-the-art accuracy in several emerging Artificial Intelligence tasks. Nevertheless, the highest performing LSTM models are becoming increasingly demanding in terms of computational and memory load. At the same time, emerging latency-sensitive applications including mobile robots and autonomous vehicles often operate under stringent computation time constraints. In this paper, we address the challenge of deploying computationally demanding LSTMs at a constrained time budget by introducing an approximate computing scheme that combines iterative low-rank compression and pruning, along with a novel FPGA-based LSTM architecture. Combined in an end-to-end framework, the approximation method parameters are optimised and the architecture is configured to address the problem of high-performance LSTM execution in time-constrained applications. Quantitative evaluation on a real-life image captioning application indicates that the proposed system required up to 6.5(times ) less time to achieve the same application-level accuracy compared to a baseline method, while achieving an average of 25(times ) higher accuracy under the same computation time constraints.
Year
DOI
Venue
2018
10.1007/978-3-319-78890-6_1
ARC
DocType
Volume
Citations 
Conference
abs/1801.02190
7
PageRank 
References 
Authors
0.47
9
4
Name
Order
Citations
PageRank
Michalis Rizakis170.47
Stylianos I. Venieris210612.98
Alexandros Kouris371.14
Christos Savvas Bouganis440049.04