Title
Exploiting parallelism of imperfect nested loops with sibling inner loops on coarse-grained reconfigurable architectures.
Abstract
Coarse-grained reconfigurable architecture (CGRA) is a promising platform for loop acceleration, but existing software pipelining methods cannot achieve satisfactory performance on a fair number of imperfect nested loops, especially those with sibling inner loops. To tackle this problem, this paper makes 2 contributions: 1) a 2-level pipelining method with an effective II optimization strategy for the imperfect loops with sibling inner loops; 2) a novel kernel compression method to reduce oversize kernel. Experiment results show that our approach can achieve much higher performance than the state-of-the-art approaches at acceptable costs.
Year
DOI
Venue
2016
10.1109/ASPDAC.2016.7428054
ASP-DAC
Keywords
Field
DocType
data compression,optimisation,parallel architectures,reconfigurable architectures,CGRA,II optimization strategy,coarse-grained reconfigurable architecture,kernel compression method,loop acceleration,loop parallelism
Kernel (linear algebra),Pipeline (computing),Imperfect,Software pipelining,Computer science,Parallel computing,Real-time computing,Acceleration,Nested loop join,For loop
Conference
ISSN
Citations 
PageRank 
2153-6961
0
0.34
References 
Authors
12
4
Name
Order
Citations
PageRank
Xinhan Lin1112.62
shouyi yin257999.95
leibo liu3816116.95
Shaojun Wei4555102.32