Title | ||
---|---|---|
A method for communication efficient work distributions in stencil operation based applications on heterogeneous clusters |
Abstract | ||
---|---|---|
In recent years, the use of accelerators in conjunction with CPUs, known as heterogeneous computing, has brought about significant performance increases for scientific applications. One of the best examples of this is Lattice Quantum Chromo-Dynamics (QCD), a stencil operation based simulation. These simulations have a large memory footprint necessitating the use of many graphics processing units (GPUs) in parallel. This requires the use of a heterogeneous cluster with one or more GPUs per node. In order to obtain optimal performance, it is necessary to determine an efficient communication pattern between GPUs on the same node and between nodes. In this paper we present a performance model based method for minimizing the communication time of applications with stencil operations, such as Lattice QCD, on heterogeneous computing systems with a non-blocking Infiniband interconnection network. The proposed method is able to increase the performance of the most computationally intensive kernel of Lattice QCD by 25 percent due to improved overlapping of communication and computation. |
Year | DOI | Venue |
---|---|---|
2012 | 10.1109/HPCSim.2012.6266960 | High Performance Computing and Simulation |
Keywords | Field | DocType |
graphics processing units,simulation,CPU,GPU,QCD,communication pattern,graphics processing units,heterogeneous clusters,heterogeneous computing,infiniband interconnection network,lattice quantum chromo-dynamics,stencil operation based simulation,work distributions,GPU acceleration,Heterogeneous computing,Lattice QCD,nearest neighbor,performance model,stencil operation | InfiniBand,Computer science,Parallel computing,Stencil,Symmetric multiprocessor system,Bandwidth (signal processing),Lattice QCD,Graphics processing unit,Memory footprint,Benchmark (computing) | Conference |
ISBN | Citations | PageRank |
978-1-4673-2359-8 | 0 | 0.34 |
References | Authors | |
5 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Schneible, J. | 1 | 0 | 0.34 |
Lubomir Riha | 2 | 39 | 14.31 |
Malik Magdon-Ismail | 3 | 914 | 104.34 |
tarek elghazawi | 4 | 697 | 84.30 |