Title
An Auto-Tuning With Adaptation Of A64 Scalable Vector Extension For Spiral
Abstract
In this paper, we propose an auto-tuning (AT) system by adapting the A64 Scalable Vector Extension for SPIRAL to generate discrete Fourier transform (DFT) implementations. The performance of our method is evaluated using the Supercomputer "Flow" at Nagoya University. The A64 scalable vector extension applied DFT codes are up to 1.98 times faster than scalar DFT codes and up to 3.63 times higher in terms of the SIMD instruction rate. In addition, we obtain a factor of maximum speedup 2.32 by adapting proposed AT system for loop unrolling.
Year
DOI
Venue
2021
10.1109/IPDPSW52791.2021.00117
2021 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW)
Keywords
DocType
Citations 
Auto-tuning, Discrete Fourier Transform, SPIRAL, SIMD, A64 SVE
Conference
0
PageRank 
References 
Authors
0.34
0
6
Name
Order
Citations
PageRank
Naruya Kitai100.34
Daisuke Takahashi229739.92
Franz Franchetti397488.39
Takahiro Katagiri400.34
Satoshi Ohshima500.34
Toru Nagai602.03