A high-performing comprehensive learning algorithm for text classification without pre-labeled training set - Citegraph

Paper Info

Title
A high-performing comprehensive learning algorithm for text classification without pre-labeled training set

Abstract
In this paper, we investigate a comprehensive learning algorithm for text classification without pre-labeled training set based on incremental learning. In order to overcome the high cost in getting labeled training examples, this approach reforms fuzzy partition clustering to obtain a small quantity of labeled training data. Then the incremental learning of Bayesian classifier is applied. The model of the proposed classifier is composed of a Naïve-Bayes-based incremental learning algorithm and a modified fuzzy partition clustering method. For improved efficiency, a feature reduction is designed based on the Quadratic Entropy in Mutual Information. We perform experiments to demonstrate the performance of the approach, and the results show that our approach is feasible and effective.

Year	DOI	Venue
2011	10.1007/s10115-011-0387-3	Knowl. Inf. Syst.
Keywords	Field	DocType
Text classification,Clustering,Dimension reduction,Fuzzy clustering,Incremental learning	Data mining,Fuzzy clustering,Semi-supervised learning,Computer science,Artificial intelligence,Conceptual clustering,Cluster analysis,Population-based incremental learning,Stability (learning theory),Naive Bayes classifier,Pattern recognition,Correlation clustering,Algorithm,Machine learning	Journal
Volume	Issue	ISSN
29	3	0219-1377
Citations	PageRank	References
9	0.47	15
Authors
2

Authors (2 rows)

Cited by (9 rows)

References (15 rows)

Name	Order	Citations	PageRank
Lizhen Liu	1	38	7.95
Qianhui Liang	2	275	20.24

1