Title
Matrix-based streamization approach for improving locality and parallelism on FT64 stream processor
Abstract
FT64 is the first 64-bit stream processor designed for scientific computing. It is critical to exploit optimizing streamization approaches for scientific applications on FT64 due to the inefficiency of direct streamization approach. In this paper, we propose a novel matrix-based streamization approach for improving locality and parallelism of scientific applications on FT64. First, a Data&Computation Matrix is built to abstract the relationship between loops and arrays of the original programs, and it is helpful for formulating the streamization problem. Second, three key techniques for optimizing streamization approach are proposed based on the transformations of the matrix, i.e., coarse-grained program transformations, fine-grained program transformations, and stream organization optimizations. Finally, we apply our approach to ten typical scientific application kernels on FT64. The experimental results show that the matrix-based streamization approach achieves an average speedup of 2.76 over the direct streamization approach, and performs equally to or better than the corresponding Fortran programs on Itanium 2 except CG. It is certain that the matrix-based streamization approach is a promising and practical solution to efficiently exploit the tremendous potential of FT64.
Year
DOI
Venue
2009
10.1007/s11227-008-0186-0
The Journal of Supercomputing
Keywords
Field
DocType
FT64,D&C Matrix,Streamization,Program transformation,Stream organization
Locality,Matrix calculus,Program transformation,Computer science,Matrix (mathematics),Parallel computing,Itanium,Fortran,Stream processing,Distributed computing,Speedup
Journal
Volume
Issue
ISSN
47
2
0920-8542
Citations 
PageRank 
References 
5
0.41
25
Authors
4
Name
Order
Citations
PageRank
Xuejun Yang1122.29
Jing Du2378.95
Xiaobo Yan3316.01
Yu Deng4245.13