Title
Performance Evaluation of NAS Parallel Benchmarks on Intel Xeon Phi
Abstract
NAS parallel benchmarks (NPB) are a set of applications commonly used to evaluate parallel systems. We use the NPB-OpenMP version to examine the performance of the Intel's new Xeon Phi co-processor and focus in particular on the many core aspect of the Xeon Phi architecture. A first analysis studies the scalability up to 244 threads on 61 cores and the impact of affinity settings on scaling. It also compares performance characteristics of Xeon Phi and traditional Xeon CPUs. The application of several well-established optimization techniques allows us to identify common bottlenecks that can specifically impede performance on the Xeon Phi but are not as severe on multi-core CPUs. We also find that many of the OpenMP-parallel loops are too short (in terms of the number of loop iterations) for a balanced execution by 244 threads. New or redesigned benchmarks will be needed to accommodate the greatly increased number of cores and threads. At the end, we summarize our findings in a set recommendations for performance optimization for Xeon Phi.
Year
DOI
Venue
2013
10.1109/ICPP.2013.87
Parallel Processing
Keywords
Field
DocType
microprocessor chips,optimisation,parallel processing,performance evaluation,Intel new Xeon Phi coprocessor,NAS parallel benchmarks,NPB-OpenMP version,OpenMP parallel loops,Xeon Phi architecture,optimization techniques,parallel systems,performance evaluation,Multicore processing,Parallel programming,Performance analysis
Computer science,Instruction set,Xeon Phi,Parallel computing,Thread (computing),Hyper-threading,Xeon,Multi-core processor,Benchmark (computing),Scalability
Conference
ISSN
Citations 
PageRank 
0190-3918
20
0.89
References 
Authors
4
4
Name
Order
Citations
PageRank
Ramachandran, A.1200.89
Vienne, J.2200.89
Rob F. Van der Wijngaart337445.61
Koesterke, L.4200.89