Title
Improved strong scaling of a spectral/finite difference gyrokinetic code for multi-scale plasma turbulence
Abstract
Optimization techniques of a 5D plasma turbulence simulation code are presented.Segmented MPI-process mapping on 3D torus networks reduces communication costs.Pipelined computation-communication overlaps effectively mask communication costs.The code achieves strong excellent strong scaling up to 600k cores.Cross-scale interactions of multi-scale plasma turbulence is demonstrated. Optimization techniques of a plasma turbulence simulation code GKV for improved strong scaling are presented. This work is motivated by multi-scale plasma turbulence extending over multiple spatio-temporal scales of electrons and ions, whose simulations based on the gyrokinetic theory require huge calculations of five-dimensional (5D) computational fluid dynamics by means of spectral and finite difference methods. First, we present the multi-layer domain decomposition of the multi-dimensional and multi-species problem, and segmented MPI-process mapping on 3D torus interconnects, which fully utilizes the bi-section bandwidth for data transpose and reduces the conflicts of simultaneous point-to-point communications. These techniques reduce the inter-node communication cost drastically. Second, pipelined computation-communication overlaps are implemented by using the OpenMP/MPI hybrid parallelization, which effectively mask the communication cost. Careful regulations of the pipeline length and of the thread granularity respectively suppress latencies of MPI, load imbalance and scheduling overheads of OpenMP. Thanks to the above optimizations, GKV achieves excellent strong scaling up to ~600k cores with high computational performance 782.4¿TFlops (8.29% of the theoretical peak) and high effective parallelization rate ~99.99994% on K, which demonstrates its applicability and efficiency toward a million of cores. The optimized code realizes multi-scale plasma turbulence simulations covering electron and ion scales, and reveals cross-scale interactions of electron- and ion-scale turbulence.
Year
DOI
Venue
2015
10.1016/j.parco.2015.06.001
Parallel Computing
Keywords
Field
DocType
Gyrokinetics,Computational fluid dynamics application,Parallel scalability,Spectral method,Finite difference
Finite difference,Computer science,Parallel computing,Theoretical computer science,Thread (computing),Bandwidth (signal processing),Finite difference method,Granularity,Scaling,Domain decomposition methods,Gyrokinetics
Journal
Volume
Issue
ISSN
49
C
0167-8191
Citations 
PageRank 
References 
2
0.48
10
Authors
6
Name
Order
Citations
PageRank
Shinya Maeyama120.82
Tomohiko Watanabe2163.00
Yasuhiro Idomura35714.27
Motoki Nakata4142.61
Masanori Nunami5164.25
A. Ishizawa661.62