Accelerating CNN computation: quantisation tuning and network resizing - Citegraph

Paper Info

Title
Accelerating CNN computation: quantisation tuning and network resizing

Abstract
The interest in developing cognitive aware systems, specially for vision applications based on artificial neural networks, has grown exponentially in the last years. While high performance systems are key for the success of current Convolutional Neural Network (CNN) implementations, there is a trend to bring these capabilities to embedded real-time systems. This work contributes to tackle this challenge by exploring CNNs design space. Namely, it combines parameter quantisation techniques with a proposed set of CNN architectural transformations to reduce resource and execution time costs on Field Programmable Gate Array (FPGA) devices while maintaining high classification accuracy. An hardware mapping methodology is also proposed for deploying resource constrained CNNs into a reconfigurable platform for efficient algorithm acceleration. The proposed transformations reduce accuracy loss due to quantization by 44% in average. Also, analysis of the performance results obtained in a Central Processing Unit (CPU)+FPGA platform show up to 50% execution time reduction when compared with a state-of-the-art implementation.

Year	DOI	Venue
2018	10.1145/3295816.3295820	Proceedings of the 2nd Workshop on AutotuniNg and aDaptivity AppRoaches for Energy efficient HPC Systems
Keywords	DocType	ISBN
FPGA, OpenCL, convolutional neural network, driver assistance, object recognition	Conference	978-1-4503-6591-8
Citations	PageRank	References
0	0.34	0
Authors
4

Authors (4 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Alexandre Vieira	1	0	0.34
Frederico Pratas	2	119	15.69
Leonel Sousa	3	1210	145.50
Aleksandar Ilic	4	283	35.40

1