Title
DLUX: a LUT-based Near-Bank Accelerator for Data Center Deep Learning Training Workloads
Abstract
The frequent data movement between the processor and the memory has become a severe performance bottleneck for deep neural network (DNN) training workloads in data centers. To solve this off-chip memory access challenge, the 3-D stacking processing-in-memory (3D-PIM) architecture provides a viable solution. However, existing 3D-PIM designs for DNN training suffer from the limited memory bandwidth ...
Year
DOI
Venue
2021
10.1109/TCAD.2020.3021336
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
Keywords
DocType
Volume
Training,Table lookup,Random access memory,Bandwidth,Layout,Three-dimensional displays
Journal
40
Issue
ISSN
Citations 
8
0278-0070
0
PageRank 
References 
Authors
0.34
0
7
Name
Order
Citations
PageRank
Peng Gu114211.30
Xinfeng Xie2526.39
Shuangchen Li363636.82
Dimin Niu460931.36
Hongzhong Zheng51225.94
Krishna T. Malladi624918.37
Yuan Xie76430407.00