Approximating Dunn's Cluster Validity Indices for Partitions of Big Data. - Citegraph

Paper Info

Title
Approximating Dunn's Cluster Validity Indices for Partitions of Big Data.

Abstract
Dunn's internal cluster validity index is used to assess partition quality and subsequently identify a ``best'' crisp partition of n objects. Computing Dunn's index (DI) for partitions of n p-dimensional feature vector data has quadratic time complexity O(pn²), so its computation is impractical for very large values of n. This note presents six methods for approximating DI. Four methods are based on Maximin sampling, which identifies a skeleton of the full partition that contains some boundary points in each cluster. Two additional methods are presented that estimate boundary points associated with unsupervised training of one class support vector machines. Numerical examples compare approximations to DI based on all six methods. Four experiments on seven real and synthetic data sets support our assertion that computing approximations to DI with an incremental, neighborhood-based Maximin skeleton is both tractable and reliably accurate.

Year	DOI	Venue
2019	10.1109/TCYB.2018.2806886	IEEE transactions on cybernetics
Keywords	Field	DocType
Indexes,Big Data,Clustering algorithms,Measurement,Skeleton,Cybernetics,Support vector machines	Minimax,Mathematical optimization,Feature vector,Combinatorics,Support vector machine,Sampling (statistics),Cluster analysis,Partition (number theory),Time complexity,Mathematics,Computation	Journal
Volume	Issue	ISSN
49	5	2168-2275
Citations	PageRank	References
0	0.34	0
Authors
5

Authors (5 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Punit Rathore	1	26	4.53
Zahra Ghafoori	2	11	2.64
James C. Bezdek	3	3521	625.56
M. Palaniswami	4	4107	290.84
Christopher Leckie	5	2422	155.20

1