Abstract | ||
---|---|---|
This paper presents the design and implementation of an Open Computing Language (OpenCL) framework for the Matrix-2000 many-core architecture. This architecture is designed to replace the Intel XeonPhi accelerators of the TianHe-2 supercomputer. We share our experience and insights on how to design an effective OpenCL system for this new hardware accelerator. We propose a set of new analysis and optimizations to unlock the potential of the hardware. We extensively evaluate our approach using a wide range of OpenCL benchmarks on a single and multiple computing nodes. We present our design choices and provide guidance how to optimize code on the new Matrix-2000 architecture. |
Year | Venue | Field |
---|---|---|
2018 | CF | Architecture,Computer architecture,Supercomputer,Matrix (mathematics),Computer science,Parallel computing,Symmetric multiprocessor system,Hardware acceleration |
DocType | Citations | PageRank |
Conference | 0 | 0.34 |
References | Authors | |
19 | 6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Peng Zhang | 1 | 48 | 5.09 |
Tao Tang | 2 | 42 | 7.44 |
Jianbin Fang | 3 | 265 | 25.31 |
Chun Huang | 4 | 13 | 8.00 |
Canqun Yang | 5 | 188 | 29.39 |
Zheng Wang | 6 | 215 | 18.10 |