Title
Multi-level Grid Based Clustering and GPU Parallel Implementations
Abstract
Clustering algorithm for stream data, as one of stream data mining technologies, has extensive applications on network traffic analysis, telecommunication, planetary remote sensing, web site analysis, etc. Clustering algorithm for stream data has a high demand for real-time processing, but current clustering algorithms for stream data, such as Clustream, Dstream, are all based on sequential algorithms, which are unable to meet the realtime requirement. In this paper, we propose a multi-grid based clustering algorithm for stream data. The algorithm partitions the grid space appropriately on the basis of conventional grid-based DBSCAN clustering algorithm, which can effectively limit the searching scope of grid neighbours to accelerate processing performance. Meanwhile, we utilize CUDA to conduct parallel computing in order to further speed up processing. Through the experiments tested on the KDDCUP-99 open testing dataset, it shows that the processing speed of the algorithm proposed by the paper is 10 times faster than that of the conventional grid-based algorithm and moreover the CUDA based algorithm can achieve an speedup of 3 compared with the algorithm executed on CPU.
Year
DOI
Venue
2017
10.1109/ISPAN-FCST-ISCC.2017.75
2017 14th International Symposium on Pervasive Systems, Algorithms and Networks & 2017 11th International Conference on Frontier of Computer Science and Technology & 2017 Third International Symposium of Creative Computing (ISPAN-FCST-ISCC)
Keywords
Field
DocType
Multi-grid,clustering algorithm for stream data,GPU parallelization
Traffic analysis,Algorithm design,CUDA,Computer science,Parallel computing,Implementation,Cluster analysis,Grid,DBSCAN,Speedup
Conference
ISBN
Citations 
PageRank 
978-1-5386-0841-8
0
0.34
References 
Authors
9
4
Name
Order
Citations
PageRank
Quan Qian1134.54
Shuai Zhao200.34
Chao-Jie Xiao300.34
Che Lun Hung44019.53