Title
A Statistical Performance Analysis of Graph Clustering Algorithms.
Abstract
Measuring graph clustering quality remains an open problem. Here, we introduce three statistical measures to address the problem. We empirically explore their behavior under a number of stress test scenarios and compare it to the commonly used modularity and conductance. Our measures are robust, immune to resolution limit, easy to intuitively interpret and also have a formal statistical interpretation. Our empirical stress test results confirm that our measures compare favorably to the established ones. In particular, they are shown to be more responsive to graph structure, less sensitive to sample size and breakdowns during numerical implementation and less sensitive to uncertainty in connectivity. These features are especially important in the context of larger data sets or when the data may contain errors in the connectivity patterns.
Year
DOI
Venue
2018
10.1007/978-3-319-92871-5_11
ALGORITHMS AND MODELS FOR THE WEB GRAPH (WAW 2018)
Field
DocType
Volume
Stress test,Discrete mathematics,Graph,Data set,Open problem,Computer science,Theoretical computer science,Conductance,Clustering coefficient,Modularity,Sample size determination
Conference
10836
ISSN
Citations 
PageRank 
0302-9743
0
0.34
References 
Authors
11
4
Name
Order
Citations
PageRank
Pierre Miasnikof100.34
Alexander Y. Shestopaloff201.35
Anthony J. Bonner3733422.63
yuri lawryshyn452.11