Abstract | ||
---|---|---|
Abstract Data Stream Clustering is an active area of research which requires efficient algorithms capable of finding and updating clusters incrementally as data arrives. On top of that, due to the inherent evolving nature of data streams, it is expected that algorithms undergo both concept drifts and evolutions, which must be taken into account by the clustering algorithm, allowing incremental clustering updates. In this paper we present the Social Network Clusterer Stream + (SNCStream + ). SNCStream + tackles the data stream clustering problem as a network formation and evolution problem, where instances and micro-clusters form clusters based on homophily. Our proposal has its parameters analyzed and it is evaluated in a broad set of problems against literature baselines. Results show that SNCStream + achieves superior clustering quality (CMM), and feasible processing time and memory space usage when compared to the original SNCStream and other proposals of the literature. |
Year | Venue | Field |
---|---|---|
2016 | Inf. Syst. | Data mining,Canopy clustering algorithm,Fuzzy clustering,CURE data clustering algorithm,Data stream clustering,Correlation clustering,Computer science,Algorithm,Determining the number of clusters in a data set,Constrained clustering,Cluster analysis,Database |
DocType | Volume | Citations |
Journal | 62 | 4 |
PageRank | References | Authors |
0.39 | 19 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Jean Paul Barddal | 1 | 140 | 16.77 |
Heitor Murilo Gomes | 2 | 155 | 17.36 |
Fabrício Enembreck | 3 | 274 | 38.42 |
Jean-Paul A. Barthès | 4 | 1090 | 151.60 |