Title
An Efficient Active Constraint Selection Algorithm for Clustering
Abstract
In this paper, we address the problem of active query selection for clustering with constraints. The objective is to determine automatically a set of queries and their associated must-link and can-not link constraints to help constraints based clustering algorithms to converge. Some works on active constraints learning have already been proposed but they are only applied to K-Means like clustering algorithms which are known to be limited to spherical clusters while we are interested in constraints-based clustering algorithms that deals with clusters of arbitrary shapes and sizes (like Constrained-DBSCAN, Constrained-Hierarchical Clustering. . . ). Our novel approach relies on a k-nearest neighbors graph to estimate the dense regions of the data space and generates queries at the frontier between clusters where the cluster membership is most uncertain. Experiments show that our framework improves the performance of constraints based clustering algorithms.
Year
DOI
Venue
2010
10.1109/ICPR.2010.727
ICPR
Keywords
Field
DocType
pattern clustering,constraints-based clustering algorithm,data space,associated must-link,clustering algorithm,active constraint,arbitrary shape,can-not link constraint,pairwise constraint,active learning,cluster membership,k-nearest neighbors graph,constraints based clustering algorithms,k-means like clustering algorithms,constrained-hierarchical clustering,constraint handling,active query selection,graph theory,query selection,active constraint selection algorithm,constrained dbscan,query processing,clustering,efficient active constraint selection,k means,hierarchical clustering,shape,indexes,artificial neural networks,clustering algorithms,glass,skeleton,k nearest neighbor
Data mining,Canopy clustering algorithm,Fuzzy clustering,CURE data clustering algorithm,Data stream clustering,Correlation clustering,Pattern recognition,Computer science,Determining the number of clusters in a data set,Artificial intelligence,Constrained clustering,Cluster analysis
Conference
ISSN
ISBN
Citations 
1051-4651
978-1-4244-7542-1
9
PageRank 
References 
Authors
0.47
10
3
Name
Order
Citations
PageRank
Viet-Vu Vu1524.75
Nicolas Labroche213917.87
Bernadette Bouchon-meunier31033173.38