Title | ||
---|---|---|
A high-performing comprehensive learning algorithm for text classification without pre-labeled training set |
Abstract | ||
---|---|---|
In this paper, we investigate a comprehensive learning algorithm for text classification without pre-labeled training set based on incremental learning. In order to overcome the high cost in getting labeled training examples, this approach reforms fuzzy partition clustering to obtain a small quantity of labeled training data. Then the incremental learning of Bayesian classifier is applied. The model of the proposed classifier is composed of a Naïve-Bayes-based incremental learning algorithm and a modified fuzzy partition clustering method. For improved efficiency, a feature reduction is designed based on the Quadratic Entropy in Mutual Information. We perform experiments to demonstrate the performance of the approach, and the results show that our approach is feasible and effective. |
Year | DOI | Venue |
---|---|---|
2011 | 10.1007/s10115-011-0387-3 | Knowl. Inf. Syst. |
Keywords | Field | DocType |
Text classification,Clustering,Dimension reduction,Fuzzy clustering,Incremental learning | Data mining,Fuzzy clustering,Semi-supervised learning,Computer science,Artificial intelligence,Conceptual clustering,Cluster analysis,Population-based incremental learning,Stability (learning theory),Naive Bayes classifier,Pattern recognition,Correlation clustering,Algorithm,Machine learning | Journal |
Volume | Issue | ISSN |
29 | 3 | 0219-1377 |
Citations | PageRank | References |
9 | 0.47 | 15 |
Authors | ||
2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Lizhen Liu | 1 | 38 | 7.95 |
Qianhui Liang | 2 | 275 | 20.24 |