Title
Re-evaluation of Atomic Operations and Graph Coloring for Unstructured Finite Volume GPU Simulations
Abstract
In general, race condition can be resolved by introducing synchronisations or breaking data dependencies. Atomic operations and graph coloring are the two typical approaches to avoid race condition. Graph coloring algorithms have been generally considered winning algorithms in the literature due to their lock free implementations. In this paper, we present the GPU-accelerated algorithms of the unstructured cell-centered finite volume Computational Fluid Dynamics (CFD) software framework named PHengLEI which was originally developed for aerodynamics applications with arbitrary hybrid meshes. Overall, the newly developed GPU framework demonstrate up to 4.8 speedup comparing with 18 MPI tasks run on the latest Intel CPU node. Furthermore, the enormous efforts have been invested to optimize data dependencies which could lead to race condition due to unstructured mesh indirect addressing and related reduction math operations. With careful comparison between our optimised graph coloring and atomic operations using a series of numerical tests with different mesh sizes, the results show that atomic operations are more efficient than our optimised graph coloring in all of the test cases on Nvidia Tesla GPU V100. Specifically, for the summation operation, using atomicAdd is twice as fast as graph coloring. For the maximum operation, a speedup of 1.5 to 2 is found for atomicMax vs. graph coloring.
Year
DOI
Venue
2020
10.1109/SBAC-PAD49847.2020.00048
2020 IEEE 32nd International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)
Keywords
DocType
ISSN
Cell Centred Finite Volume,Unstructured Mesh,Atomics,Graph Coloring,GPU,CUDA,CFD Aerodynamics
Conference
1550-6533
ISBN
Citations 
PageRank 
978-1-7281-9925-2
0
0.34
References 
Authors
5
6
Name
Order
Citations
PageRank
Xi Zhang100.34
Xu Sun200.34
Xiaohu Guo355943.85
Yunfei Du47214.62
Yutong Lu530753.61
Yang Liu61568126.97