BaGuaLu: targeting brain scale pretrained models with over 37 million cores

Paper Info

Title
BaGuaLu: targeting brain scale pretrained models with over 37 million cores

Abstract
ABSTRACTLarge-scale pretrained AI models have shown state-of-the-art accuracy in a series of important applications. As the size of pretrained AI models grows dramatically each year in an effort to achieve higher accuracy, training such models requires massive computing and memory capabilities, which accelerates the convergence of AI and HPC. However, there are still gaps in deploying AI applications on HPC systems, which need application and system co-design based on specific hardware features. To this end, this paper proposes BaGuaLu1, the first work targeting training brain scale models on an entire exascale supercomputer, the New Generation Sunway Supercomputer. By combining hardware-specific intra-node optimization and hybrid parallel strategies, BaGuaLu enables decent performance and scalability on unprecedentedly large models. The evaluation shows that BaGuaLu can train 14.5-trillion-parameter models with a performance of over 1 EFLOPS using mixed-precision and has the capability to train 174-trillion-parameter models, which rivals the number of synapses in a human brain.

Year	DOI	Venue
2022	10.1145/3503221.3508417	Principles and Practice of Parallel Programming
DocType	Citations	PageRank
Conference	0	0.34
References	Authors
0	25

Authors (25 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Zixuan Ma	1	0	1.01
Jiaao He	2	2	1.46
Jiezhong Qiu	3	268	12.48
Huanqi Cao	4	0	1.35
Yuanwei Wang	5	0	0.34
Zhenbo Sun	6	0	1.01
Liyan Zheng	7	0	1.35
Haojie Wang	8	2	3.75
Shizhi Tang	9	0	1.35
Tianyu Zheng	10	0	0.34
Junyang Lin	11	0	0.34
Guanyu Feng	12	0	1.35
Zeqiang Huang	13	0	0.34
Jie Gao	14	0	0.34
Aohan Zeng	15	0	0.34
Jianwei Zhang	16	0	0.34
Runxin Zhong	17	0	0.68
Tianhui Shi	18	0	0.68
Sha Liu	19	0	0.34
Weimin Zheng	20	1889	182.48
Jie Tang	21	5871	300.22
Hongxia Yang	22	271	35.55
Xin Liu	23	0	0.34
Jidong Zhai	24	340	36.27
Wenguang Chen	25	1014	70.57