Title
Improving performance of linear algebra algorithms for dense matrices, using algorithmic prefetch
Abstract
In this paper, we introduce a concept called algorithmic prefetching, for exploiting some of the features of the IBM RISC System/6000® computer. Algorithmic prefetching denotes changing algorithm A to algorithm B, which contains additional steps to move data from slower levels of memory to faster levels, with the aim that algorithm B outperform algorithm A. The objective of algorithmic prefetching is to minimize any penalty due to cache misses in the innermost loop of an algorithm. This concept, along with “cache blocking,” can be exploited to improve the performance of linear algebra algorithms for dense matrices. We experimentally demonstrated the impact of prefetching on two dense-matrix operations. For one operation, the performance was improved from 74% of peak to 89% of peak by algorithmic prefetching; for the second operation, it was improved from 73% to 87% of the peak performance.
Year
DOI
Venue
1994
10.1147/rd.383.0265
IBM Journal of Research and Development
Keywords
Field
DocType
dense matrix,improving performance,linear algebra algorithm,algorithmic prefetch,linear algebra
Linear algebra,Loop nest optimization,IBM,Cache,Matrix (mathematics),Computer science,Parallel computing,Algorithm,Real-time computing,Instruction prefetch
Journal
Volume
Issue
ISSN
38
3
0018-8646
Citations 
PageRank 
References 
22
8.51
3
Authors
3
Name
Order
Citations
PageRank
Agarwal, Ramesh C.13311.48
Gustavson, F.G.223042.78
Zubair, M.37421.50