Title
CNewsTS - A Large-scale Chinese News Dataset with Hierarchical Topic Category and Summary
Abstract
ABSTRACTIn this paper, we present a large Chinese news article dataset with 4.4 million articles. These articles are obtained from different news channels and sources. They are labeled with multi-level topic categories, and some of them also have summaries. This is the first Chinese news dataset that has both hierarchical topic labels and article full texts. And it is also the largest Chinese news topic dataset. We describe the data collection, annotation and quality evaluation process. The basic statistics of the dataset, comparison with other datasets and benchmark experiments are also presented.
Year
DOI
Venue
2022
10.1145/3511808.3557561
Conference on Information and Knowledge Management
DocType
Citations 
PageRank 
Conference
0
0.34
References 
Authors
0
3
Name
Order
Citations
PageRank
Quanzhi Li100.34
Yingchi Liu200.34
Yang Chao300.34