Title
Hybrid Decision Based Chinese News Headline Classification.
Abstract
In recent years, short text classification is attracting more attention. With the development of social platforms such as micro blogging and wechatting, Chinese short text classification has great impact on public opinion analysis and sentiment mining. Among social media texts, news headline classification has substantial influence on both academia and Internet economy. The issues such as semantic sparsity caused by the limited length of texts, and the grammatical nonstandard of the text, have prevented the performance of classification. In the paper, a Chinese news headline classification method based on multi model decision is proposed. First, an effective Convolutional Neural Network (CNN) is applied as one of text classifiers, at the same time, a Long Short-Term Memory (LSTM) is used as another text classifier as well. The aim is to obtain both abstract semantics of news headlines through CNN and context information between word sequences through LSTM. Second, an efficient text categorization tool - fastText (Facebook) is introduced to get the most excellent and balanced results. Finally, a decision model is proposed to favor the best performance of classification. A simple but very effective voting system is proposed and the result is very promising. Experiments based on the dataset from nlpcc 2017 Task2 has proved the efficiency of our method. Our method achieves much higher performance ((F_{1}) of 79%) than the baseline provided by nlpcc 2017.
Year
Venue
Field
2018
APWeb/WAIM Workshops
Headline,Social media,Voting,Convolutional neural network,Computer science,Microblogging,Natural language processing,Decision model,Artificial intelligence,Classifier (linguistics),Machine learning,Semantics
DocType
Citations 
PageRank 
Conference
0
0.34
References 
Authors
7
5
Name
Order
Citations
PageRank
Yukun Cao101.01
Xiaofei Xu240870.26
Ye Du300.34
Jun He4121.96
Li Li5177.07