Title
Configurable Deep Learning Accelerator with Bitwise-accurate Training and Verification
Abstract
This paper introduces an end-to-end solution to a deep neural network (DNN) inference system. We customize and enrich a family of deep-learning accelerators (DLA) based on the NVIDIA open-source deep-learning accelerator (NVDLA). Our exclusive enhancement includes hardware and software parts. The hardware part is the shared multiplier array of both high-efficient regular and depth-wise convolution...
Year
DOI
Venue
2022
10.1109/VLSI-DAT54769.2022.9768062
2022 International Symposium on VLSI Design, Automation and Test (VLSI-DAT)
Keywords
DocType
ISBN
Deep learning,Training,Power demand,Neural networks,Prototypes,Object detection,Very large scale integration
Conference
978-1-6654-0921-6
Citations 
PageRank 
References 
0
0.34
0
Authors
4
Name
Order
Citations
PageRank
Shien-Chun Luo100.34
Kuo-Chiang Chang200.34
Po-Wei Chen300.34
Zhao-Hong Chen400.34