Title
Mapping Imperfect Loops to Coarse-Grained Reconfigurable Architectures.
Abstract
Nested loops represent a significant portion of application runtime in multimedia and DSP applications, an important domain of applications for coarse-grained reconfigurable architectures (CGRAs). While conventional approaches to mapping nested loops utilize only a single-dimensional pipelining, which is either along the innermost loop or along an outer loop, in this paper, we explore an orthogonal approach of pipelining along multiple loop dimensions by first flattening the loop nest. To remedy the inevitable problem of repetitive outer-loop computation in flattened loops, we present a small set of special operations that can effectively reduce the number and frequency of micro-operations in the pipelined loop. We also present a loop transformation technique that can make our special operations applicable to a broader range of loops, including those with triangular iteration spaces. Our experimental results using imperfect loops from StreamIt benchmarks demonstrate that our special operations can cover a large portion of operations in flattened loops, improve performance of nested loops by nearly 30% over using loop flattening only, and achieve near-ideal executions on CGRAs for imperfect loops.
Year
DOI
Venue
2016
10.1109/TCAD.2015.2504918
IEEE Trans. on CAD of Integrated Circuits and Systems
Keywords
Field
DocType
Arrays,Pipeline processing,Kernel,Acceleration,Hardware
Loop fusion,Pipeline (computing),Computer science,Parallel computing,Loop fission,Loop splitting,Loop inversion,Real-time computing,Loop tiling,Loop interchange,Nested loop join
Journal
Volume
Issue
ISSN
35
7
0278-0070
Citations 
PageRank 
References 
0
0.34
25
Authors
4
Name
Order
Citations
PageRank
Hyeon Uk Sim1315.19
Hongsik Lee2101.52
Seongseok Seo321.04
Jongeun Lee442933.71