Title
Self-labeled techniques for semi-supervised learning: taxonomy, software and empirical study
Abstract
Semi-supervised classification methods are suitable tools to tackle training sets with large amounts of unlabeled data and a small quantity of labeled data. This problem has been addressed by several approaches with different assumptions about the characteristics of the input data. Among them, self-labeled techniques follow an iterative procedure, aiming to obtain an enlarged labeled data set, in which they accept that their own predictions tend to be correct. In this paper, we provide a survey of self-labeled methods for semi-supervised classification. From a theoretical point of view, we propose a taxonomy based on the main characteristics presented in them. Empirically, we conduct an exhaustive study that involves a large number of data sets, with different ratios of labeled data, aiming to measure their performance in terms of transductive and inductive classification capabilities. The results are contrasted with nonparametric statistical tests. Note is then taken of which self-labeled models are the best-performing ones. Moreover, a semi-supervised learning module has been developed for the Knowledge Extraction based on Evolutionary Learning software, integrating analyzed methods and data sets.
Year
DOI
Venue
2015
10.1007/s10115-013-0706-y
Knowl. Inf. Syst.
Keywords
Field
DocType
self-training,semi-supervised learning,learning from unlabeled data,multi-view learning,classification,co-training
Transduction (machine learning),Data mining,Data set,Semi-supervised learning,Computer science,Co-training,Nonparametric statistics,Software,Unsupervised learning,Knowledge extraction,Artificial intelligence,Machine learning
Journal
Volume
Issue
ISSN
42
2
0219-3116
Citations 
PageRank 
References 
74
1.81
67
Authors
3
Name
Order
Citations
PageRank
Isaac Triguero163331.76
Salvador García234311.50
Francisco Herrera3273911168.49