Title
Influence Of Architectural Features Of The Snc-4 Mode Of The Intel Xeon Phi Knl On Matrix Multiplication
Abstract
The Sub-NUMA Clustering 4 (SNC-4) affinity mode of the Intel Xeon Phi Knights Landing introduces a new environment for parallel applications, that provides a NUMA system in a single chip.The main target of this work is to characterize the behaviour of this system, focusing in nested parallelization for a well known algorithm, with regular and predictable memory access patterns as the matrix multiplication. It has been studied the effects of thread distribution in the processor on the performance when using SNC-4 affinity mode, the differences between cache and flat modes of the MCDRAM and the improvements due to vectorization in different scenarios in terms of data locality.Results show that the best thread location is the scatter distribution, using 64 or 128 threads. Differences between cache and flat modes of the MCDRAM are, generally, not significant. The use of optimization techniques as padding to improve locality has a great impact on execution times. Vectorization resulted to be efficient only when the data locality is good, specially when the MCDRAM is used as a cache.
Year
DOI
Venue
2019
10.1007/978-3-030-22750-0_41
COMPUTATIONAL SCIENCE - ICCS 2019, PT V
Keywords
Field
DocType
Intel Xeon Phi KNL, SNC-4, MCDRAM, Vectorization
MCDRAM,Computer science,Xeon Phi,Parallel computing,Vectorization (mathematics),Chip,Cluster analysis,Matrix multiplication,Distributed computing
Conference
Volume
ISSN
Citations 
11540
0302-9743
0
PageRank 
References 
Authors
0.34
0
3
Name
Order
Citations
PageRank
Ruben Laso100.34
F. F. Rivera29015.12
José C. Cabaleiro3103.44