Title | ||
---|---|---|
Performance evaluation and analysis of thread pinning strategies on multi-core platforms: Case study of SPEC OMP applications on intel architectures |
Abstract | ||
---|---|---|
With the introduction of multi-core processors, thread affinity has quickly appeared to be one of the most important factors to accelerate program execution times. The current article presents a complete experimental study on the performance of various thread pinning strategies. We investigate four application independent thread pinning strategies and five application sensitive ones based on cache sharing. We made extensive performance evaluation on three different multi-core machines reflecting three usual utilisation: workstation machine, server machine and high performance machine. In overall, we show that fixing thread affinities (whatever the tested strategy) is a better choice for improving program performance on HPC ccNUMA machines compared to OS-based thread placement. This means that the current Linux OS scheduling strategy is not necessarily the best choice in terms of performance on ccNUMA machines, even if it is a good choice in terms of cores usage ratio and work balancing. On smaller Core2 and Nehalem machines, we show that the benefit of thread pinning is not satisfactory in terms of speedups versus OS based scheduling, but the performance stability is much better. |
Year | DOI | Venue |
---|---|---|
2011 | 10.1109/HPCSim.2011.5999834 | High Performance Computing and Simulation |
Keywords | Field | DocType |
Linux,cache storage,microprocessor chips,multiprocessing systems,performance evaluation,scheduling,Core2 machines,HPC ccNUMA machine,Intel architectures,Linux OS scheduling strategy,Nehalem machines,OS based thread placement,SPEC OMP applications,cache sharing,cores usage ratio,high performance machine,multicore processors,performance evaluation,program execution times,server machine,thread affinity,thread pinning strategies,work balancing,workstation machine,Multi-Cores,OpenMP,Operating Systems,Thread Affinity,Thread Level Parallelism | Instruction set,Cache,Scheduling (computing),Computer science,Task parallelism,Parallel computing,Workstation,Thread (computing),Processor affinity,Multi-core processor,Operating system | Conference |
ISBN | Citations | PageRank |
978-1-61284-380-3 | 13 | 0.82 |
References | Authors | |
8 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Abdelhafid Mazouz | 1 | 51 | 5.13 |
Sid-Ahmed-Ali Touati | 2 | 91 | 12.27 |
Denis Barthou | 3 | 238 | 26.14 |