Title | ||
---|---|---|
GOPipe: A Granularity-Oblivious Programming Framework for Pipelined Stencil Executions on GPU |
Abstract | ||
---|---|---|
Recent studies have shown promising performance benefits when multiple stages of a pipelined stencil application are mapped to different parts of a GPU to run concurrently. An important factor for the computing efficiency of such pipelines is the granularity of a task. In previous programming frameworks that support true pipelined computations on GPU, the choice has to be made by the programmers during the application development time. Due to many difficulties, programmers' decisions are often far from optimal, causing inferior performance and performance portability.
This paper presents GOPipe, a granularity-oblivious programming framework for efficient pipelined stencil executions on GPU. With GOPipe, programmers no longer need to specify the appropriate task granularity. GOPipe automatically finds it, and dynamically schedules tasks of that granularity for efficiency while observing all inter-task and inter-stage data dependencies. In our experiments on six real-life applications and various scenarios, GOPipe outperforms the state-of-the-art system by 1.39X on average with a much better programming productivity.
|
Year | DOI | Venue |
---|---|---|
2020 | 10.1145/3410463.3414656 | PACT '20: International Conference on Parallel Architectures and Compilation Techniques
Virtual Event
GA
USA
October, 2020 |
DocType | ISBN | Citations |
Conference | 978-1-4503-8075-1 | 0 |
PageRank | References | Authors |
0.34 | 0 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Chanyoung Oh | 1 | 12 | 4.65 |
Zhen Zheng | 2 | 7 | 1.93 |
Xipeng Shen | 3 | 2025 | 118.55 |
Jidong Zhai | 4 | 340 | 36.27 |
Youngmin Yi | 5 | 281 | 25.93 |