Title
Enhancing Classification Effectiveness of Chinese News Based on Term Frequency
Abstract
For the daily news published on the web, in general, they can be classified into various categories, such as social, politics, entertainment, and so on. These classifications motivate users to watch the desired information. If the classification is wrong, user cannot catch accurately context. How to accurately classify the daily news is becoming an important issue. In this paper, we will propose a method to enhance the effectiveness of news classification. We will utilize the term frequency appeared in variety of classified historical news to training the weighting of each category of each term. And then classify the test news based on the weighting. We propose a framework and an algorithm to training the weighting of each term. The training data, which are over 3500 Chinese news, are collected from UDN and LTN, which are two major electrical news portals in Taiwan. Based on the weighting mechanism, we conduct some experiments to evaluate the effectiveness of the algorithm. The test data are 170 Chinese news, which are collected from Google. The result shows that the traditional manually classification method has up to 13% error classification.
Year
DOI
Venue
2017
10.1109/SC2.2017.26
2017 IEEE 7th International Symposium on Cloud and Service Computing (SC2)
Keywords
Field
DocType
News classification,TF-IDF,Chinese news,Text mining
Training set,Weighting,Information retrieval,Entertainment,Computer science,Test data,Encyclopedia,Statistical classification,The Internet,Electronic publishing
Conference
ISBN
Citations 
PageRank 
978-1-5386-5863-5
0
0.34
References 
Authors
4
2
Name
Order
Citations
PageRank
Tzu-Yi Chan100.34
Yue-Shan Chang229537.68