Title
Loop Optimization using Hierarchical Compilation and Kernel Decomposition
Abstract
The increasing complexity of hardware features for re- cent processors makes high performance code genera- tion very challenging. In particular, several optimiza- tion targets have to be pursued simultaneously (minimizing L1/L2/L3/TLB misses and maximizing instruction level par- allelism). Very often, these optimization goals impose dif- ferent and contradictory constraints on the transformations to be applied. We propose a new hierarchical compilation approach for the generation of high performance code relying on the use of state-of-the-art compilers. This approach is not application-dependent and do not require any assembly hand-coding. It relies on the decomposition of the origi- nal loop nest into simpler kernels, typically 1D to 2D loops, much simpler to optimize. We successfully applied this approach to optimize dense matrix muliply primitives (not only for the square case but to the more general rectangular cases) and convolution. The performance of the optimized codes on Itanium 2 and Pentium 4 architectures outperforms ATLAS and in most cases, matches hand-tuned vendor libraries (e.g. MKL).
Year
DOI
Venue
2007
10.1109/CGO.2007.22
CGO
Keywords
Field
DocType
optimized code,assembly hand-coding,dense matrix muliply primitive,loop optimization,tion target,cent processor,kernel decomposition,new hierarchical compilation approach,contradictory constraint,hierarchical compilation,high performance code,high performance code genus,simpler kernel,code generation,hardware,atlas,assembly,constraint optimization,kernel
Kernel (linear algebra),Computer science,Parallel computing,Loop optimization,Itanium,Real-time computing,Compiler,Pentium,Translation lookaside buffer,Sparse matrix,Constrained optimization
Conference
ISSN
ISBN
Citations 
2164-2397
0-7695-2764-7
4
PageRank 
References 
Authors
0.46
15
5
Name
Order
Citations
PageRank
Denis Barthou123826.14
Sebastien Donadio2341.73
Patrick Carribault324025.96
Alexandre Duchateau440.46
William Jalby5733148.24