Title | ||
---|---|---|
DLUX: a LUT-based Near-Bank Accelerator for Data Center Deep Learning Training Workloads |
Abstract | ||
---|---|---|
The frequent data movement between the processor and the memory has become a severe performance bottleneck for deep neural network (DNN) training workloads in data centers. To solve this off-chip memory access challenge, the 3-D stacking processing-in-memory (3D-PIM) architecture provides a viable solution. However, existing 3D-PIM designs for DNN training suffer from the limited memory bandwidth ... |
Year | DOI | Venue |
---|---|---|
2021 | 10.1109/TCAD.2020.3021336 | IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems |
Keywords | DocType | Volume |
Training,Table lookup,Random access memory,Bandwidth,Layout,Three-dimensional displays | Journal | 40 |
Issue | ISSN | Citations |
8 | 0278-0070 | 0 |
PageRank | References | Authors |
0.34 | 0 | 7 |
Name | Order | Citations | PageRank |
---|---|---|---|
Peng Gu | 1 | 142 | 11.30 |
Xinfeng Xie | 2 | 52 | 6.39 |
Shuangchen Li | 3 | 636 | 36.82 |
Dimin Niu | 4 | 609 | 31.36 |
Hongzhong Zheng | 5 | 122 | 5.94 |
Krishna T. Malladi | 6 | 249 | 18.37 |
Yuan Xie | 7 | 6430 | 407.00 |