Title | ||
---|---|---|
Online Cluster Validity Indices For Performance Monitoring Of Streaming Data Clustering |
Abstract | ||
---|---|---|
Cluster analysis is used to explore structure in unlabeled batch data sets in a wide range of applications. An important part of cluster analysis is validating the quality of computationally obtained clusters. A large number of different internal indices have been developed for validation in the offline setting. However, this concept cannot be directly extended to the online setting because streaming algorithms do not retain the data, nor maintain a partition of it, both needed by batch cluster validity indices. In this paper, we develop two incremental versions (with and without forgetting factors) of the Xie-Beni and Davies-Bouldin validity indices, and use them to monitor and control two streaming clustering algorithms (sk-means and online ellipsoidal clustering), In this context, our new incremental validity indices are more accurately viewed as performance monitoring functions. We also show that incremental cluster validity indices can send a distress signal to online monitors when evolving structure leads an algorithm astray. Our numerical examples indicate that the incremental Xie-Beni index with a forgetting factor is superior to the other three indices tested. |
Year | DOI | Venue |
---|---|---|
2019 | 10.1002/int.22064 | INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS |
DocType | Volume | Issue |
Journal | 34 | 4 |
ISSN | Citations | PageRank |
0884-8173 | 2 | 0.38 |
References | Authors | |
0 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Masud Moshtaghi | 1 | 195 | 15.96 |
James C. Bezdek | 2 | 3521 | 625.56 |
Sarah M. Erfani | 3 | 236 | 23.58 |
Christopher Leckie | 4 | 2422 | 155.20 |
James Bailey | 5 | 2172 | 164.56 |