Title
Performance modeling of 3D MPDATA simulations on GPU cluster.
Abstract
The goal of this study is to parallelize the multidimensional positive definite advection transport algorithm (MPDATA) across a computational cluster equipped with GPUs. Our approach permits us to provide an extensive overlapping GPU computations and data transfers, both between computational nodes, as well as between the GPU accelerator and CPU host within a node. For this aim, we decompose a computational domain into two unequal parts which correspond to either data dependent or data independent parts. Then, data transfers can be performed simultaneously with computations corresponding to the second part. Our approach allows for achieving 16.372 Tflop/s using 136 GPUs. To estimate the scalability of the proposed approach, a performance model dedicated to MPDATA simulations is developed. We focus on the analysis of computation and communication execution times, as well as the influence of overlapping data transfers and GPU computations, with regard to the number of nodes.
Year
DOI
Venue
2017
10.1007/s11227-016-1774-z
The Journal of Supercomputing
Keywords
Field
DocType
EULAG,MPDATA,Stencils,GPU cluster,MPI,Performance model
GPU cluster,Computer science,Parallel computing,Data dependent,Positive-definite matrix,Computational science,Performance model,Scalability,Computation
Journal
Volume
Issue
ISSN
73
2
0920-8542
Citations 
PageRank 
References 
3
0.39
15
Authors
2
Name
Order
Citations
PageRank
Krzysztof Rojek1869.02
Roman Wyrzykowski272190.65