Title
Numerical Applications and Sub-Word Parallelism: The NAS Benchmarks on a Pentium 4
Abstract
We examine the impact of Pentium 4 SIMD instructions on the Fortran and C versions of the NAS benchmarks, either by compiler vectorization or by assembly code in-lining. If few functions generally profit from the SIMD operations, the ones using complex numbers or random number generators can be efficiently accelerated.
Year
DOI
Venue
2002
10.1109/HPCSA.2002.1019156
HPCS
Keywords
Field
DocType
sub-word parallelism,c version,simd operation,compiler vectorization,complex number,simd instruction,nas benchmarks,numerical applications,random number generator,assembly code in-lining,assembly,registers,acceleration,benchmark testing,parallel processing,random number generation,profitability,random number generators
Complex number,Computer science,Parallel computing,SIMD,Fortran,Vectorization (mathematics),Assembly language,Compiler,Pentium,Random number generation
Conference
ISBN
Citations 
PageRank 
0-7695-1626-2
0
0.34
References 
Authors
2
1
Name
Order
Citations
PageRank
Daniel Etiemble130042.43