Title
Streamline Density Peak Clustering for Practical Adoptions
Abstract
Since Density Peak Clustering (DPC) algorithm was proposed in 2014, it has drawn lots of interest in various domains. As a clustering method, DPC features superior generality, robustness, flexibility and simplicity. There are however two main roadblocks for its practical adoptions, both centered around the selection of cutoff distance, the single critical hyperparameter of DPC. This work proposes an improved algorithm named Streamlined Density Peak Clustering (SDPC). SDPC speeds up DPC executions on a sequence of cutoff distances by 2.2-8.8X while at the same time reducing memory usage by a magnitude. As an algorithm preserving the original semantic of DPC, SDPC offers an efficient and scalable drop-in replacement of DPC for data clustering.
Year
DOI
Venue
2019
10.1145/3357384.3358053
Proceedings of the 28th ACM International Conference on Information and Knowledge Management
Keywords
Field
DocType
algorithm optimization, density clustering, hyperparameter tuning
Data mining,Information retrieval,Computer science,Cluster analysis
Conference
ISBN
Citations 
PageRank 
978-1-4503-6976-3
0
0.34
References 
Authors
0
3
Name
Order
Citations
PageRank
Shuai Yang101.35
Xipeng Shen22025118.55
Min Chi318728.26