CPM: A large-scale generative Chinese Pre-trained language model

Paper Info

Title
CPM: A large-scale generative Chinese Pre-trained language model

Abstract
Pre-trained Language Models (PLMs) have proven to be beneficial for various downstream NLP tasks. Recently, GPT-3, with 175 billion parameters and 570 GB training data, drew a lot of attention due to the capacity of few-shot (even zero-shot) learning. However, applying GPT-3 to address Chinese NLP tasks is still challenging, as the training corpus of GPT-3 is primarily English, and the parameters are not publicly available. In this technical report, we release the Chinese Pre-trained Language Model (CPM) with generative pre-training on large-scale Chinese training data. To the best of our knowledge, CPM, with 2.6 billion parameters and 100 GB Chinese training data, is the largest Chinese pre-trained language model, which could facilitate several downstream Chinese NLP tasks, such as conversation, essay generation, cloze test, and language understanding. Extensive experiments demonstrate that CPM achieves strong performance on many NLP tasks in the settings of few-shot (even zero-shot) learning. The code and parameters are available at https://github.com/TsinghuaAI/CPM.

Year	DOI	Venue
2021	10.1016/j.aiopen.2021.07.001	AI Open
Keywords	DocType	Volume
Pre-trained language model,Zero-shot learning	Journal	2
ISSN	Citations	PageRank
2666-6510	0	0.34
References	Authors
0	25

Authors (25 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Zhengyan Zhang	1	9	1.10
Xu Han	2	15	4.94
Hao Zhou	3	0	0.68
Pei Ke	4	4	1.42
Yuxian Gu	5	0	0.34
Deming Ye	6	3	2.06
Yujia Qin	7	0	0.68
YuSheng Su	8	0	0.68
Haozhe Ji	9	0	2.03
Jian Guan	10	4	1.08
Fanchao Qi	11	12	7.27
Xiaozhi Wang	12	5	4.17
Yanan Zheng	13	0	0.34
Guoyang Zeng	14	1	1.71
Huanqi Cao	15	0	0.68
Shengqi Chen	16	1	2.04
Daixuan Li	17	0	0.34
Zhenbo Sun	18	0	0.68
Zhiyuan Liu	19	2037	123.68
Minlie Huang	20	1260	90.68
Wentao Han	21	103	8.22
Jie Tang	22	5871	300.22
Juanzi Li	23	2526	154.08
Xiaoyan Zhu	24	2125	141.16
Maosong Sun	25	2293	162.86