Title
High-throughput fuzzy clustering on heterogeneous architectures
Abstract
The Internet of Things (IoT) is pushing the next economic revolution in which the main players are data and immediacy. IoT is increasingly producing large amounts of data that are now classified as “dark data” because most are created but never analyzed. The efficient analysis of this data deluge is becoming mandatory in order to transform it into meaningful information. Among the techniques available for this purpose, clustering techniques, which classify different patterns into groups, have proven to be very useful for obtaining knowledge from the data. However, clustering algorithms are computationally hard, especially when it comes to large data sets and, therefore, they require the most powerful computing platforms on the market. In this paper, we investigate coarse and fine grain parallelization strategies in Intel and Nvidia architectures of fuzzy minimals (FM) algorithm; a fuzzy clustering technique that has shown very good results in the literature. We provide an in-depth performance analysis of the FM’s main bottlenecks, reporting a speed-up factor of up to 40× compared to the sequential counterpart version.
Year
DOI
Venue
2020
10.1016/j.future.2020.01.022
Future Generation Computer Systems
Keywords
Field
DocType
Parallel fuzzy clustering,Fuzzy clustering,Fuzzy minimals
Fuzzy clustering,Data mining,Data set,Computer science,Fuzzy logic,Internet of Things,Immediacy,Throughput,Cluster analysis,Dark data,Distributed computing
Journal
Volume
ISSN
Citations 
106
0167-739X
0
PageRank 
References 
Authors
0.34
0
5
Name
Order
Citations
PageRank
Juan Manuel Cebrian12410.19
Baldomero Imbernon2126.48
Jesús A. Soto3132.67
Jose M. García4273.90
José M. Cecilia516622.28