Title
SNCStream+: Extending a high quality true anytime data stream clustering algorithm.
Abstract
Abstract Data Stream Clustering is an active area of research which requires efficient algorithms capable of finding and updating clusters incrementally as data arrives. On top of that, due to the inherent evolving nature of data streams, it is expected that algorithms undergo both concept drifts and evolutions, which must be taken into account by the clustering algorithm, allowing incremental clustering updates. In this paper we present the Social Network Clusterer Stream + (SNCStream + ). SNCStream + tackles the data stream clustering problem as a network formation and evolution problem, where instances and micro-clusters form clusters based on homophily. Our proposal has its parameters analyzed and it is evaluated in a broad set of problems against literature baselines. Results show that SNCStream + achieves superior clustering quality (CMM), and feasible processing time and memory space usage when compared to the original SNCStream and other proposals of the literature.
Year
Venue
Field
2016
Inf. Syst.
Data mining,Canopy clustering algorithm,Fuzzy clustering,CURE data clustering algorithm,Data stream clustering,Correlation clustering,Computer science,Algorithm,Determining the number of clusters in a data set,Constrained clustering,Cluster analysis,Database
DocType
Volume
Citations 
Journal
62
4
PageRank 
References 
Authors
0.39
19
4
Name
Order
Citations
PageRank
Jean Paul Barddal114016.77
Heitor Murilo Gomes215517.36
Fabrício Enembreck327438.42
Jean-Paul A. Barthès41090151.60