Title
HARP: Harnessing inactive threads in many-core processors
Abstract
SIMT accelerators are equipped with thousands of computational resources. Conventional accelerators, however, fail to fully utilize available resources due to branch and memory divergences. This underutilization is manifested in two underlying inefficiencies: pipeline width underutilization and pipeline depth underutilization. Width underutilization occurs when SIMD execution units are not entirely utilized due to branch divergences. This affects lane activity and results in SIMD inefficiency. Depth underutilization takes place when the pipeline runs out of active threads and is forced to leave pipeline stages idle. This work addresses both inefficiencies by harnessing inactive threads available to the pipeline. We introduce Harnessing inActive thReads in many-core Processors (or simply HARP) to improve width and depth utilization in accelerators. We show how using inactive yet ready threads can enhance performance. Moreover, we investigate implementation details and study microarchitectural changes needed to build a HARP-enhanced accelerator. Furthermore, we evaluate HARP under a variety of microarchitectural design points. We measure the area overhead associated with HARP and compare to conventional alternatives. Under Fermi-like GPUs, we show that HARP provides 10% speedup on average (maximum of 1.6X) at the cost of 3.5% area overhead. Our analysis shows that HARP performs better under narrower SIMD and shorter pipelines.
Year
DOI
Venue
2014
10.1145/2567938
ACM Trans. Embedded Comput. Syst.
Keywords
Field
DocType
simd inefficiency,pipeline width underutilization,shorter pipeline,width underutilization,depth underutilization,simd execution unit,inactive thread,pipeline depth underutilization,many-core processor,pipeline stage,narrower simd,depth utilization,multithreading
Multithreading,Pipeline transport,Computer science,Parallel computing,SIMD,Real-time computing,Thread (computing),Branch divergence,HARP,Embedded system,Speedup
Journal
Volume
Issue
ISSN
13
3s
1539-9087
Citations 
PageRank 
References 
2
0.37
19
Authors
3
Name
Order
Citations
PageRank
Ahmad Lashgar1165.45
Ahmad Khonsari221042.43
Amirali Baniasadi322133.12