Abstract | ||
---|---|---|
Solving a banded linear system efficiently is important to many scientific and engineering applications. Current solvers achieve good scalability only on the linear systems that can be partitioned into independent subsystems. In this paper, we present a GPU based, scalable Bi-Conjugate Gradient Stabilized solver that can be used to solve a wide range of banded linear systems. We utilize a row-oriented matrix decomposition method to divide the banded linear system into several correlated sublinear systems and solve them on multiple GPUs collaboratively. We design a number of GPU and MPI optimizations to speedup inter-GPU and inter-machine communications. We evaluate the solver on Poisson equation and advection diffusion equation as well as several other banded linear systems. the slover achieves a speedup of more than 21 times running from 6 to 192 GPUs on the XSEDE's Keeneland supercomputer and because of small communication overhead, can scale upto 32 GPUs on Amazon EC2 with relatively slow ethernet network. |
Year | DOI | Venue |
---|---|---|
2013 | 10.1109/CLUSTER.2013.6702612 | 2013 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER) |
Keywords | Field | DocType |
poisson equation,local area networks,linear systems | Poisson's equation,Linear system,Supercomputer,CUDA,Computer science,Parallel computing,Matrix decomposition,Computational science,Solver,Scalability,Speedup | Conference |
ISSN | Citations | PageRank |
1552-5244 | 3 | 0.38 |
References | Authors | |
18 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Hang Liu | 1 | 835 | 90.79 |
Jung Hee Seo | 2 | 60 | 5.57 |
Rajat Mittal | 3 | 170 | 17.59 |
H. Howie Huang | 4 | 537 | 40.29 |