Title
An efficient algorithm for maximal margin clustering
Abstract
Maximal margin based frameworks have emerged as a powerful tool for supervised learning. The extension of these ideas to the unsupervised case, however, is problematic since the underlying optimization entails a discrete component. In this paper, we first study the computational complexity of maximal hard margin clustering and show that the hard margin clustering problem can be precisely solved in O(n d+2) time where n is the number of the data points and d is the dimensionality of the input data. However, since it is well known that many datasets commonly `express' themselves primarily in far fewer dimensions, our interest is in evaluating if a careful use of dimensionality reduction can lead to practical and effective algorithms. We build upon these observations and propose a new algorithm that gradually increases the number of features used in the separation model in each iteration, and analyze the convergence properties of this scheme. We report on promising numerical experiments based on a `truncated' version of this approach. Our experiments indicate that for a variety of datasets, good solutions equivalent to those from other existing techniques can be obtained in significantly less time.
Year
DOI
Venue
2012
10.1007/s10898-011-9691-4
J. Global Optimization
Keywords
Field
DocType
Maximum margin,Clustering,Unsupervised,SDP
Data point,Canopy clustering algorithm,Mathematical optimization,Dimensionality reduction,Margin (machine learning),Algorithm,Curse of dimensionality,Supervised learning,Cluster analysis,Mathematics,Computational complexity theory
Journal
Volume
Issue
ISSN
52
1
0925-5001
Citations 
PageRank 
References 
7
0.48
20
Authors
5
Name
Order
Citations
PageRank
Ji-Ming Peng150045.74
Lopamudra Mukherjee241521.97
Vikas Singh328120.78
Dale Schuurmans42760317.49
Linli Xu579042.51