Title
Mind the eigen-gap, or how to accelerate semi-supervised spectral learning algorithms
Abstract
Semi-supervised learning algorithms commonly incorporate the available background knowledge such that an expression of the derived model's quality is improved. Depending on the specific context quality can take several forms and can be related to the generalization performance or to a simple clustering coherence measure. Recently, a novel perspective of semi-supervised learning has been put forward, that associates semi-supervised clustering with the efficiency of spectral methods. More precisely, it has been demonstrated that the appropriate use of partial supervision can bias the data Laplacian matrix such that the necessary eigenvector computations are provably accelerated. This result allows data mining practitioners to use background knowledge not only for improving the quality of clustering results, but also for accelerating the required computations. In this paper we initially provide a high level overview of the relevant efficiency maximizing semi-supervised methods such that their theoretical intuitions are comprehensively outlined. Consecutively, we demonstrate how these methods can be extended to handle multiple clusters and also discuss possible issues that may arise in the continuous semi-supervised solution. Finally, we illustrate the proposed extensions empirically in the context of text clustering.
Year
DOI
Venue
2011
10.5591/978-1-57735-516-8/IJCAI11-448
IJCAI
Keywords
Field
DocType
semi-supervised learning,appropriate use,specific context quality,simple clustering coherence measure,semi-supervised method,available background knowledge,text clustering,clustering result,continuous semi-supervised solution,semi-supervised spectral
Data mining,Document clustering,Computer science,Artificial intelligence,Cluster analysis,Eigenvalues and eigenvectors,Computation,Laplacian matrix,Algorithm,Coherence (physics),Constrained clustering,High- and low-level,Machine learning
Conference
Citations 
PageRank 
References 
3
0.46
7
Authors
1
Name
Order
Citations
PageRank
Dimitrios Mavroeidis11309.50