Title
Performance Analysis of SIMD Vectorization of High-Order Finite-Element Kernels
Abstract
Physics-based three-dimensional numerical simulations are becoming more predictive and are already essential for improving the understanding of natural phenomena, such as earthquakes, tsunami, flooding or climate change and global warming. Among the numerical methods available to support these simulations, Finite-Element formulations have been implemented in several major software packages. The efficiency of these algorithms remains a challenge due to the irregular memory access that prevents the squeezing out of the maximum level of performance out of current architectures. This is particularly true at the shared-memory level with several levels of parallelism and complex memory hierarchies. Despite significant efforts, automatic optimizations provided by compilers and high-level frameworks are often far from the performances obtained from hand-tuned implementations. In this paper, we have extracted a kernel from the EFISPEC software package developed at BRGM (the French Geological Survey). This application implements a high-order finite-element method to solve the elastodynamic equation. We characterize the performance of the extracted mini-app considering key parameters such as the order of the approximation, the memory access pattern or the vector length. Based on this study, we detail specific optimizations and we discuss the results measured as regards to the roofline performance model on Intel Broadwell and Skylake architectures.
Year
DOI
Venue
2018
10.1109/HPCS.2018.00074
2018 International Conference on High Performance Computing & Simulation (HPCS)
Keywords
Field
DocType
physics-based three-dimensional numerical simulations,natural phenomena,earthquakes,flooding,global warming,numerical methods,software packages,irregular memory access,shared-memory level,complex memory hierarchies,automatic optimizations,hand-tuned implementations,EFISPEC software package,French Geological Survey,high-order finite-element method,memory access pattern,vector length,roofline performance model,SIMD vectorization,finite-element formulations,high-order finite-element kernels
Kernel (linear algebra),Euclidean vector,Parallel computing,Vectorization (mathematics),SIMD,Compiler,Finite element method,Software,Geology,Intrinsics
Conference
ISBN
Citations 
PageRank 
978-1-5386-7880-0
0
0.34
References 
Authors
14
5
Name
Order
Citations
PageRank
Gauthier Sornet100.34
Sylvain Jubertie2255.70
Fabrice Dupros310011.40
Florent De Martin4101.68
Sébastien Limet516321.80