Title
Building text classifiers using positive, unlabeled and 'outdated' examples.
Abstract
Learning from positive and unlabeled examples PU learning is a partially supervised classification that is frequently used in Web and text retrieval system. The merit of PU learning is that it can get good performance with less manual work. Motivated by transfer learning, this paper presents a novel method that transfers the 'outdated data' into the process of PU learning. We first propose a way to measure the strength of the features and select the strong features and the weak features according to the strength of the features. Then, we extract the reliable negative examples and the candidate negative examples using the strong and the weak features Transfer-1DNF. Finally, we construct a classifier called weighted voting iterative support vector machine SVM that is made up of several subclassifiers by applying SVM iteratively, and each subclassifier is assigned a weight in each iteration. We conduct the experiments on two datasets: 20 Newsgroups and Reuters-21578, and compare our method with three baseline algorithms: positive example-based learning, weighted voting classifier and SVM. The results show that our proposed method Transfer-1DNF can extract more reliable negative examples with lower error rates, and our classifier outperforms the baseline algorithms. Copyright © 2016 John Wiley & Sons, Ltd.
Year
DOI
Venue
2016
10.1002/cpe.3879
Concurrency and Computation: Practice and Experience
Keywords
Field
DocType
text classification,strong features,weak features,Transfer-1DNF,WV-ISVM
PU learning,Pattern recognition,Computer science,Transfer of learning,Support vector machine,Weighted voting,Artificial intelligence,Classifier (linguistics),Text retrieval,Machine learning
Journal
Volume
Issue
ISSN
28
13
1532-0626
Citations 
PageRank 
References 
2
0.36
17
Authors
5
Name
Order
Citations
PageRank
Jiayu Han1377.43
Wanli Zuo234242.73
Lu Liu3284.39
Yuanbo Xu4172.22
Tao Peng59812.70