Title | ||
---|---|---|
CNewsTS - A Large-scale Chinese News Dataset with Hierarchical Topic Category and Summary |
Abstract | ||
---|---|---|
ABSTRACTIn this paper, we present a large Chinese news article dataset with 4.4 million articles. These articles are obtained from different news channels and sources. They are labeled with multi-level topic categories, and some of them also have summaries. This is the first Chinese news dataset that has both hierarchical topic labels and article full texts. And it is also the largest Chinese news topic dataset. We describe the data collection, annotation and quality evaluation process. The basic statistics of the dataset, comparison with other datasets and benchmark experiments are also presented. |
Year | DOI | Venue |
---|---|---|
2022 | 10.1145/3511808.3557561 | Conference on Information and Knowledge Management |
DocType | Citations | PageRank |
Conference | 0 | 0.34 |
References | Authors | |
0 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Quanzhi Li | 1 | 0 | 0.34 |
Yingchi Liu | 2 | 0 | 0.34 |
Yang Chao | 3 | 0 | 0.34 |