Title
A high-performing comprehensive learning algorithm for text classification without pre-labeled training set
Abstract
In this paper, we investigate a comprehensive learning algorithm for text classification without pre-labeled training set based on incremental learning. In order to overcome the high cost in getting labeled training examples, this approach reforms fuzzy partition clustering to obtain a small quantity of labeled training data. Then the incremental learning of Bayesian classifier is applied. The model of the proposed classifier is composed of a Naïve-Bayes-based incremental learning algorithm and a modified fuzzy partition clustering method. For improved efficiency, a feature reduction is designed based on the Quadratic Entropy in Mutual Information. We perform experiments to demonstrate the performance of the approach, and the results show that our approach is feasible and effective.
Year
DOI
Venue
2011
10.1007/s10115-011-0387-3
Knowl. Inf. Syst.
Keywords
Field
DocType
Text classification,Clustering,Dimension reduction,Fuzzy clustering,Incremental learning
Data mining,Fuzzy clustering,Semi-supervised learning,Computer science,Artificial intelligence,Conceptual clustering,Cluster analysis,Population-based incremental learning,Stability (learning theory),Naive Bayes classifier,Pattern recognition,Correlation clustering,Algorithm,Machine learning
Journal
Volume
Issue
ISSN
29
3
0219-1377
Citations 
PageRank 
References 
9
0.47
15
Authors
2
Name
Order
Citations
PageRank
Lizhen Liu1387.95
Qianhui Liang227520.24