Title
Finding exemplars in dense data with affinity propagation on clusters of GPUs.
Abstract
This work presents an efficient implementation of affinity propagation (AP) on clusters of graphical processing units (GPUs). AP is a state-of-the-art method for finding exemplars in data sets described by similarity matrices. It is typically employed in crisp clustering applications. However, when finding exemplars in an n-pattern data set with dense, non-metric similarities, AP performs iterative processing of three nxn floating point matrices. One of them stores the similarities, and the other two store the values that will ultimately pinpoint the exemplars. For large similarity matrices, AP is therefore computationally expensive. Although matrix operations of AP are well suited for GPUs, its memory footprint limits the size of tasks that can be solved on one unit. We present, however, a decomposition scheme for AP that distributes the calculations over multiple GPUs, with low communication-to-computation ratio. Because of this favorable communication pattern, our implementation finds exemplars in large, dense similarity data efficiently, even when GPUs are connected by a slow network. Furthermore, by combining global device memory of multiple GPUs, it can solve problems that would not fit in a single unit. Copyright (c) 2012 John Wiley & Sons, Ltd.
Year
DOI
Venue
2013
10.1002/cpe.2962
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE
Keywords
DocType
Volume
multiple-GPU implementation,affinity propagation,cluster analysis,OpenCL
Journal
25
Issue
ISSN
Citations 
SP8
1532-0626
2
PageRank 
References 
Authors
0.40
7
2
Name
Order
Citations
PageRank
Marcin Kurdziel1558.33
Krzysztof Boryczko24211.27