Title
Divide and Conquer on Hybrid GPU-Accelerated Multicore Systems
Abstract
With the raw computing power of graphics processing units (GPUs) being more widely available in commodity multicore systems, there is an imminent need to harness their power for important numerical libraries such as LAPACK. In this paper, we consider the solution of dense symmetric and Hermitian eigenproblems by the LAPACK divide and conquer algorithm on such modern heterogeneous systems. We focus on how to make the best use of the individual strengths of the massively parallel manycore GPUs and multicore CPUs. The resulting algorithm overcomes performance bottlenecks faced by current implementations that are optimized for a homogeneous multicore. On a dual socket quad-core Intel Xeon 2.33 GHz with an NVIDIA GTX 280 GPU, we typically obtain up to about a tenfold improvement in performance for the complete dense problem. The techniques described here thus represent an example of how to develop numerical software to efficiently use heterogeneous architectures. As heterogeneity becomes more common in the architecture design, the significance of and need for this work are expected to grow.
Year
DOI
Venue
2012
10.1137/100806783
SIAM J. Scientific Computing
Keywords
Field
DocType
hybrid gpu-accelerated multicore systems,lapack divide,commodity multicore system,homogeneous multicore,dense symmetric,imminent need,heterogeneous architecture,important numerical library,best use,multicore cpus,complete dense problem,multicore,heterogeneous computing,performance
Graphics,Massively parallel,Computer science,Parallel computing,Symmetric multiprocessor system,Implementation,Xeon,Divide and conquer algorithms,Multi-core processor,Multicore systems
Journal
Volume
Issue
ISSN
34
2
1064-8275
Citations 
PageRank 
References 
15
0.90
21
Authors
3
Name
Order
Citations
PageRank
Christof Vömel116817.80
Stanimire Tomov21214102.02
Jack J. Dongarra3176252615.79