RaPiD: AI Accelerator for Ultra-low Precision Training and Inference

Paper Info

Title
RaPiD: AI Accelerator for Ultra-low Precision Training and Inference

Abstract
The growing prevalence and computational demands of Artificial Intelligence (AI) workloads has led to widespread use of hardware accelerators in their execution. Scaling the performance of AI accelerators across generations is pivotal to their success in commercial deployments. The intrinsic error-resilient nature of AI workloads present a unique opportunity for performance/energy improvement through precision scaling. Motivated by the recent algorithmic advances in precision scaling for inference and training, we designed RaPiD <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">1</sup> , a 4-core AI accelerator chip supporting a spectrum of precisions, namely, 16 and 8-bit floating-point and 4 and 2-bit fixed-point. The 36mm <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">2</sup> RaPiD chip fabricated in 7nm EUV technology delivers a peak 3.5 TFLOPS/W in HFP8 mode and 16.5 TOPS/W in INT4 mode at nominal voltage. Using a performance model calibrated to within 1% of the measurement results, we evaluated DNN inference using 4-bit fixed-point representation for a 4-core 1 RaPiD chip system and DNN training using 8-bit floating point representation for a 768 TFLOPs AI system comprising 4 32-core RaPiD chips. Our results show INT4 inference for batch size of 1 achieves 3 - 13.5 (average 7) TOPS/W and FP8 training for a mini-batch of 512 achieves a sustained 102 - 588 (average 203) TFLOPS across a wide range of applications.

Year	DOI	Venue
2021	10.1109/ISCA52012.2021.00021	2021 ACM/IEEE 48th Annual International Symposium on Computer Architecture (ISCA)
Keywords	DocType	ISSN
Hardware Acceleration,Deep Neural Networks,Reduced Precision	Conference	1063-6897
ISBN	Citations	PageRank
978-1-6654-3334-1	3	0.39
References	Authors
0	54

Authors (54 rows)

Cited by (3 rows)

References (0 rows)

Name	Order	Citations	PageRank
Swagath Venkataramani	1	631	39.33
Vijayalakshmi Srinivasan	2	1077	83.50
Wei Wang	3	3	0.39
Sanchari Sen	4	11	3.95
Jintao Zhang	5	3	0.39
Ankur Agrawal	6	314	22.16
Monodeep Kar	7	7	1.17
Shubham Jain	8	14	6.84
Alberto Mannari	9	3	0.39
Hoang Tran	10	3	0.39
Yulong Li	11	7	1.17
Eri Ogawa	12	3	0.39
Kazuaki Ishizaki	13	191	17.66
Hiroshi Inoue	14	77	5.88
Marcel Schaal	15	11	1.92
Mauricio J. Serrano	16	511	55.17
Jungwook Choi	17	122	18.55
Xiao Sun	18	3	0.39
Naigang Wang	19	50	7.37
Chia-Yu Chen	20	28	4.64
Allison Allain	21	3	0.39
James Bonanno	22	9	1.53
Nianzheng Cao	23	12	2.63
Robert Casatuta	24	9	1.53
Matthew Cohen	25	3	0.39
Bruce M. Fleischer	26	3	0.39
Michael Guillorn	27	12	2.63
Howard Haynie	28	12	2.63
Jinwook Jung	29	25	9.47
Mingu Kang	30	70	7.88
Kyu-Hyoun Kim	31	7	1.17
Siyu Koswatta	32	9	1.53
Sae Kyu Lee	33	8	1.13
Martin Lutz	34	9	1.53
Silvia Mueller	35	11	2.23
Jinwook Oh	36	157	20.04
Ashish Ranjan	37	3	0.39
Zhibin Ren	38	7	1.17
Scot Rider	39	9	1.53
Kerstin Schelm	40	7	1.17
Michael Scheuermann	41	11	2.23
Joel Silberman	42	12	2.63
Jie Yang	43	3	0.39
Vidhi Zalani	44	7	1.17
Xin Zhang	45	17	1.88
Ching Zhou	46	18	3.92
MATTHEW M. ZIEGLER	47	219	50.73
Vinay Shah	48	9	1.53
Moriyoshi Ohara	49	3	0.72
Pong-Fei Lu	50	12	2.97

1
2
50 / page