Title
Implementation and performance evaluation of a communication-avoiding GMRES method for stencil-based code on GPU cluster.
Abstract
In this study, a communication-avoiding generalized minimum residual method (CA-GMRES) is implemented on a hybrid CPU–GPU cluster, targeted for the performance acceleration of iterative linear system solver in the gyrokinetic toroidal five-dimensional Eulerian code (GT5D). In the GT5D, its sparse matrix-vector multiplication operation (SpMV) is performed as a 17-point stencil-based computation. The specialized part for the GT5D is only in the SpMV, and the other parts are usable also for other application program codes. In addition to the CA-GMRES, we implement and evaluate a modified variant of CA-GMRES (M-CA-GMRES) proposed in the previous study Idomura et al. (in: Proceedings of the 8th workshop on latest advances in scalable algorithms for large-scale systems (ScalA ’17), 2017. https://doi.org/10.1145/3148226.3148234) to reduce the amount of floating-point calculations. This study demonstrates that beneficial features of the CA-GMRES are in its minimum number of collective communications and its highly efficient calculations based on dense matrix–matrix operations. The performance evaluation is conducted on the Reedbush-L GPU cluster, which contains four NVIDIA Tesla P100 (Pascal GP100) GPUs per compute node. The evaluation results show that the M-CA-GMRES or CA-GMRES for the GT5D is advantageous over the GMRES or the generalized conjugate residual method (GCR) on GPU clusters, especially when the problem size (vector length) is large so that the cost of the SpMV is less dominant. The M-CA-GMRES is 1.09 ×, 1.22 × and 1.50 × faster than the CA-GMRES, GCR and GMRES, respectively, when 64 GPUs are used.
Year
DOI
Venue
2019
10.1007/s11227-019-02983-7
The Journal of Supercomputing
Keywords
Field
DocType
Communication-avoiding Krylov subspace method, GMRES, Tall-skinny QR factorization, Stencil-based computation, GPU cluster
Residual,Euclidean vector,GPU cluster,Generalized minimal residual method,Computer science,Stencil,Parallel computing,Multiplication,Solver,Conjugate residual method
Journal
Volume
Issue
ISSN
75
12
0920-8542
Citations 
PageRank 
References 
1
0.40
0
Authors
5
Name
Order
Citations
PageRank
Kazuya Matsumoto1556.37
Yasuhiro Idomura25714.27
Takuya Ina332.88
Akie Mayumi431.52
Susumu Yamada5369.54