Title
Parallel glowworm swarm optimization clustering algorithm based on MapReduce
Abstract
Clustering large data is one of the recently challenging tasks that is used in many application areas such as social networking, bioinformatics and many others. Traditional clustering algorithms need to be modified to handle the increasing data sizes. In this paper, a scalable design and implementation of glowworm swarm optimization clustering (MRCGSO) using MapReduce is introduced to handle big data. The proposed algorithm uses glowworm swarm optimization to formulate the clustering algorithm. Glowworm swarm optimization is used to take advantage of its ability in solving multimodal problems, which in terms of clustering means finding multiple centroids. MRCGSO uses the MapReduce methodology for the parallelization since it provides fault tolerance, load balancing and data locality. The experimental results reveal that MRCGSO scales very well with increasing data set sizes and achieves a very close to linear speedup while maintaining the clustering quality.
Year
DOI
Venue
2014
10.1109/SIS.2014.7011794
Swarm Intelligence
Keywords
Field
DocType
data handling,parallel processing,particle swarm optimisation,pattern clustering,MRCGSO,MapReduce,large data clustering,parallel glowworm swarm optimization clustering algorithm,Big data clustering,Hadoop,Parallel Processing
Canopy clustering algorithm,Data mining,CURE data clustering algorithm,Clustering high-dimensional data,Data stream clustering,Correlation clustering,Computer science,Glowworm swarm optimization,Multi-swarm optimization,Artificial intelligence,Cluster analysis,Machine learning
Conference
Citations 
PageRank 
References 
4
0.38
16
Authors
3
Name
Order
Citations
PageRank
Nailah AL-Madi1363.76
Ibrahim Aljarah270333.62
Simone A Ludwig31309179.41