Abstract | ||
---|---|---|
The problem of deciding whether a given set of data points forms one cluster or two clusters is investigated from a robust hypothesis testing perspective. It is assumed that a clustering algorithm exists that for both cases calculates cluster assignments and estimates of the corresponding probability density functions. Based on the latter, a statistical hypothesis test for the true number of clusters is formulated. In order to take falsely labeled data points into account, the clusters are then modeled as being contaminated with outliers. This leads to an uncertainty model for the cluster densities of the s -contamination type, whose corresponding minimax optimal robust detector is well-known and can be implemented using least favorable densities. The performance of this detector under cluster overlap, cluster imbalance, and for different contamination ratios is evaluated numerically and is compared to that of a Bayesian cluster enumeration criterion. Significant performance improvements are shown in all cases. |
Year | DOI | Venue |
---|---|---|
2019 | 10.1109/ICASSP.2019.8683180 | 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) |
Keywords | Field | DocType |
Clustering, Cluster Enumeration, Cluster Analysis, Rubust Detection | Data point,Cluster (physics),Minimax,Pattern recognition,Computer science,Outlier,Algorithm,Artificial intelligence,Cluster analysis,Probability density function,Statistical hypothesis testing,Bayesian probability | Conference |
ISSN | Citations | PageRank |
1520-6149 | 0 | 0.34 |
References | Authors | |
0 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Michael Fauss | 1 | 6 | 9.05 |
Michael Muma | 2 | 144 | 19.51 |
freweyni k teklehaymanot | 3 | 8 | 3.51 |
Abdelhak M. Zoubir | 4 | 1036 | 148.03 |