Abstract | ||
---|---|---|
Spatial computing devices have been shown to significantly accelerate stencil computations, but have so far relied on unrolling the iterative dimension of a single stencil operation to increase temporal locality. This work considers the general case of mapping directed acyclic graphs of heterogeneous stencil computations to spatial computing systems, assuming large input programs without an iterative component. StencilFlow maximizes temporal locality and ensures deadlock freedom in this setting, providing end-to-end analysis and mapping from a high-level program description to distributed hardware. We evaluate our generated architectures on a Stratix 10 FPGA testbed, yielding 1.31 TOp/s and 4.18 TOp/s on single-device and multi-device, respectively, demonstrating the highest performance recorded for stencil programs on FPGAs to date. We then leverage the framework to study a complex stencil program from a production weather simulation application. Our work enables productively targeting distributed spatial computing systems with large stencil programs, and offers insight into architecture characteristics required for their efficient execution in practice. |
Year | DOI | Venue |
---|---|---|
2021 | 10.1109/CGO51591.2021.9370315 | 2021 IEEE/ACM International Symposium on Code Generation and Optimization (CGO) |
Keywords | DocType | ISSN |
iterative dimension,single stencil operation,temporal locality,heterogeneous stencil computations,input programs,iterative component,StencilFlow,end-to-end analysis,high-level program description,single-device,complex stencil program,distributed spatial computing systems,spatial computing devices,directed acyclic graph mapping,large stencil program mapping,distributed hardware,Stratix 10 FPGA testbed,production weather simulation application,architecture characteristics | Conference | 2164-2397 |
ISBN | Citations | PageRank |
978-1-7281-8614-6 | 0 | 0.34 |
References | Authors | |
0 | 6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Johannes de Fine Licht | 1 | 9 | 4.43 |
Andreas Kuster | 2 | 0 | 0.34 |
Tiziano De Matteis | 3 | 3 | 1.20 |
Tal Ben-Nun | 4 | 116 | 14.21 |
Dominic Hofer | 5 | 0 | 0.34 |
Torsten Hoefler | 6 | 2197 | 163.64 |