Abstract | ||
---|---|---|
In order to obtain maximum performance, many applications require to extend parallelism from multi-threading to instruction-level (SIMD) parallelism that exists in many current (and future) multi-core architectures. While auto-vectorization technology has been used to exploit this SIMD level, it is not always enough due to OpenMP semantics and compiler technology limitations. In those cases, programmers need to resort to low-level intrinsics or vendor specific directives. We propose a new OpenMP directive: the simd directive. This directive will allow programmers to guide the vectorization process enabling a more productive and portable exploitation of the SIMD level. Our performance results show significant improvements over current auto-vectorizing technology of the Intel® Composer XE 2011. |
Year | DOI | Venue |
---|---|---|
2012 | 10.1007/978-3-642-30961-8_5 | IWOMP |
Keywords | Field | DocType |
simd architecture,openmp semantics,simd level,new openmp directive,auto-vectorization technology,vector construct,extending openmp,performance result,simd directive,vendor specific directive,compiler technology limitation,current auto-vectorizing technology,maximum performance,modern multicore,compilers,programming languages,interpreters | Computer architecture,Euclidean vector,Computer science,Parallel computing,SIMD,Directive,Vectorization (mathematics),Exploit,Compiler,Intrinsics,Multi-core processor | Conference |
Citations | PageRank | References |
10 | 0.93 | 12 |
Authors | ||
6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Michael Klemm | 1 | 160 | 15.52 |
Alejandro Duran | 2 | 943 | 61.43 |
Xinmin Tian | 3 | 596 | 52.92 |
Hideki Saito | 4 | 177 | 14.88 |
Diego Caballero | 5 | 20 | 2.51 |
Xavier Martorell | 6 | 1470 | 125.40 |