Abstract | ||
---|---|---|
Accelerators can offer exceptional performance advantages. However, programmers need to spend considerable efforts on acceleration, without knowing how sustainable the employed programming models, languages and tools are. To tackle this challenge, we propose and demonstrate a new runtime system called HTrOP that is able to automatically generate and execute OpenCL code from sequential CPU code. HTrOP transforms suitable data-parallel loops into independent OpenCL-typical work-items and handles concrete calls to these devices through a mix of library components and application-specific OpenCL host code. Computational hotspots are identified and can be offloaded to different resources (CPU, GPGPU and Xeon Phi). We demonstrate the potential of HTrOP on a broad set of applications and are able to improve the performance by 4.3X on average.
|
Year | DOI | Venue |
---|---|---|
2018 | 10.1145/3178487.3178534 | PPOPP |
Keywords | Field | DocType |
LLVM, multi-accelerator, openCL, runtime system, transparent acceleration | Programming paradigm,Xeon Phi,Computer science,Parallel computing,Acceleration,General-purpose computing on graphics processing units,Runtime system | Conference |
Volume | Issue | ISSN |
53 | 1 | 0362-1340 |
ISBN | Citations | PageRank |
978-1-4503-4982-6 | 0 | 0.34 |
References | Authors | |
2 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Heinrich Riebler | 1 | 13 | 3.58 |
Gavin Vaz | 2 | 12 | 2.85 |
Tobias Kenter | 3 | 13 | 6.07 |
Christian Plessl | 4 | 297 | 35.98 |