Title
Extending OpenMP* with vector constructs for modern multicore SIMD architectures
Abstract
In order to obtain maximum performance, many applications require to extend parallelism from multi-threading to instruction-level (SIMD) parallelism that exists in many current (and future) multi-core architectures. While auto-vectorization technology has been used to exploit this SIMD level, it is not always enough due to OpenMP semantics and compiler technology limitations. In those cases, programmers need to resort to low-level intrinsics or vendor specific directives. We propose a new OpenMP directive: the simd directive. This directive will allow programmers to guide the vectorization process enabling a more productive and portable exploitation of the SIMD level. Our performance results show significant improvements over current auto-vectorizing technology of the Intel® Composer XE 2011.
Year
DOI
Venue
2012
10.1007/978-3-642-30961-8_5
IWOMP
Keywords
Field
DocType
simd architecture,openmp semantics,simd level,new openmp directive,auto-vectorization technology,vector construct,extending openmp,performance result,simd directive,vendor specific directive,compiler technology limitation,current auto-vectorizing technology,maximum performance,modern multicore,compilers,programming languages,interpreters
Computer architecture,Euclidean vector,Computer science,Parallel computing,SIMD,Directive,Vectorization (mathematics),Exploit,Compiler,Intrinsics,Multi-core processor
Conference
Citations 
PageRank 
References 
10
0.93
12
Authors
6
Name
Order
Citations
PageRank
Michael Klemm116015.52
Alejandro Duran294361.43
Xinmin Tian359652.92
Hideki Saito417714.88
Diego Caballero5202.51
Xavier Martorell61470125.40