Title
Online Cluster Validity Indices For Performance Monitoring Of Streaming Data Clustering
Abstract
Cluster analysis is used to explore structure in unlabeled batch data sets in a wide range of applications. An important part of cluster analysis is validating the quality of computationally obtained clusters. A large number of different internal indices have been developed for validation in the offline setting. However, this concept cannot be directly extended to the online setting because streaming algorithms do not retain the data, nor maintain a partition of it, both needed by batch cluster validity indices. In this paper, we develop two incremental versions (with and without forgetting factors) of the Xie-Beni and Davies-Bouldin validity indices, and use them to monitor and control two streaming clustering algorithms (sk-means and online ellipsoidal clustering), In this context, our new incremental validity indices are more accurately viewed as performance monitoring functions. We also show that incremental cluster validity indices can send a distress signal to online monitors when evolving structure leads an algorithm astray. Our numerical examples indicate that the incremental Xie-Beni index with a forgetting factor is superior to the other three indices tested.
Year
DOI
Venue
2019
10.1002/int.22064
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS
DocType
Volume
Issue
Journal
34
4
ISSN
Citations 
PageRank 
0884-8173
2
0.38
References 
Authors
0
5
Name
Order
Citations
PageRank
Masud Moshtaghi119515.96
James C. Bezdek23521625.56
Sarah M. Erfani323623.58
Christopher Leckie42422155.20
James Bailey52172164.56