Recursion leads to automatic variable blocking for dense linear-algebra algorithms | 128 | 15.72 | 1997 |
Improving performance of linear algebra algorithms for dense matrices, using algorithmic prefetch | 22 | 8.51 | 1994 |
Exploiting functional parallelism of POWER2 to design high-performance numerical algorithms | 52 | 12.99 | 1994 |
A high performance algorithm using pre-processing for the sparse matrix-vector multiplication | 28 | 5.56 | 1992 |