Title | ||
---|---|---|
Compiler-Based Data Prefetching and Streaming Non-temporal Store Generation for the Intel(R) Xeon Phi(TM) Coprocessor |
Abstract | ||
---|---|---|
The Intel(R) Xeon Phi(TM) coprocessor has software prefetching instructions to hide memory latencies and special store instructions to save bandwidth on streaming non-temporal store operations. In this work, we provide details on compiler-based generation of these instructions and evaluate their impact on the performance of the Intel(R) Xeon Phi(TM) coprocessor using a wide range of parallel applications with different characteristics. Our results show that the Intel(R) Composer XE 2013 compiler can make effective use of these mechanisms to achieve significant performance improvements. |
Year | DOI | Venue |
---|---|---|
2013 | 10.1109/IPDPSW.2013.231 | IPDPS Workshops |
Keywords | Field | DocType |
composer xe,memory latency,parallel application,compiler-based generation,xeon phi,streaming non-temporal store generation,special store instruction,different characteristic,significant performance improvement,non-temporal store operation,compiler-based data prefetching,effective use,intel xeon phi,vectors,parallel processing,performance,coprocessor,coprocessors,hardware,compiler,bandwidth | Xeon Phi,Computer science,Parallel processing,Parallel computing,Compiler,Bandwidth (signal processing),Software prefetching,Xeon,Instruction prefetch,Coprocessor,Operating system | Conference |
Citations | PageRank | References |
22 | 1.31 | 13 |
Authors | ||
6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Rakesh Krishnaiyer | 1 | 174 | 19.65 |
Emre Kultursay | 2 | 265 | 11.46 |
Pankaj Chawla | 3 | 22 | 1.31 |
Serguei Preis | 4 | 28 | 1.86 |
Anatoly Zvezdin | 5 | 22 | 1.31 |
Hideki Saito | 6 | 177 | 14.88 |