Title
Improving Memory Traffic by Assembly-Level Exploitation of Reuses for Vector Registers
Abstract
In this paper, we propose a compilation scheme to analyze and exploit the implicit reuses of vector register data. According to the reuse analysis, we present a translation strategy that translates the vectorized loops into assembly vector codes with exploitation of vector reuses. Experimental results show that our compilation technique can improve the execution time and traffic between shared memory and vector registers. Techniques discussed here are simple, systematic, and easy to be implemented in the conventional vector compilers or translators to enhance the data locality of vector registers.
Year
DOI
Venue
2000
10.1023/A:1008134522009
The Journal of Supercomputing
Keywords
Field
DocType
data dependence,vector register,partial reuse,vector compilers,reuse distance,vectorization,supercomputer
Locality,Shared memory,Supercomputer,Computer science,Reuse,Parallel computing,Vectorization (mathematics),Exploit,Compiler,Execution time
Journal
Volume
Issue
ISSN
17
2
1573-0484
Citations 
PageRank 
References 
1
0.43
10
Authors
3
Name
Order
Citations
PageRank
chihyung chang12510.85
Tzung-shi Chen256247.08
Jang-Ping Sheu34173451.70