Title
A Semantic Representation Enhancement Method for Chinese News Headline Classification.
Abstract
Recently there has been an increasing research interest in short text such as news headline. Due to the inherent sparsity of short text, the current text classification methods perform badly when applied to the classification of news headlines. To overcome this problem, a novel method which enhances the semantic representation of headlines is proposed in this paper. Firstly, we add some keywords extracted from the most similar news to expand the word features. Secondly, we use the corpus in news domain to pre-train the word embedding so as to enhance the word representation. Moreover, Fasttext classifier, which uses a liner method to classify text with fast speed and high accuracy, is adopted for news headline classification. On the task for Chinese news headline categorization in NLPCC2017, the proposed method achieved 83.1% of the F-measure, which got the first rank in 33 teams.
Year
DOI
Venue
2017
10.1007/978-3-319-73618-1_27
Lecture Notes in Artificial Intelligence
Keywords
DocType
Volume
Semantic representation enhancement,Short text classification,News headline,Word embedding
Conference
10619
ISSN
Citations 
PageRank 
0302-9743
0
0.34
References 
Authors
0
6
Name
Order
Citations
PageRank
Zhongbo Yin100.34
Jintao Tang28914.00
Chengsen Ru341.40
Wei Luo412027.50
Zhunchen Luo513014.71
Xiaolei Ma620315.59