Fine-Grained Management of Thread Blocks for Irregular Applications - Citegraph

Paper Info

Title
Fine-Grained Management of Thread Blocks for Irregular Applications

Abstract
Co-locating threads with complementary resource usage is a key strategy for improving throughput in parallel machines such as GPUs. However, the proliferation of irregular algorithms which change execution behavior dynamically makes optimal thread placement impossible when done statically or by profiling kernels as a whole. In this work, we characterize the performance loss associated with current thread block scheduling policies in GPU architectures. We then demonstrate that an extension of these strategies incorporating dynamic performance metrics such as memory and functional unit utilization at the thread block level as well as preemptive thread block swapping can improve throughput. We show that performance on irregular algorithms can be improved by an average increase of 17.1% over static profiling methods and 12.9% over dynamic strategies with no changes to legacy software and minimal hardware extensions which increase SRAM storage area by less than 0.5%.

Year	DOI	Venue
2019	10.1109/ICCD46524.2019.00042	2019 IEEE 37th International Conference on Computer Design (ICCD)
Keywords	Field	DocType
GPUs, thread scheduling, resource management, preemption, irregular parallelism	Resource management,Preemption,Profiling (computer programming),Computer science,Parallel computing,Thread (computing),Static random-access memory,Throughput,Block scheduling,Legacy system	Conference
ISSN	ISBN	Citations
1063-6404	978-1-7281-1215-2	0
PageRank	References	Authors
0.34	15	2

Authors (2 rows)

Cited by (0 rows)

References (15 rows)

Name	Order	Citations	PageRank
Jonathan Beaumont	1	36	2.85
Trevor Mudge	2	6139	659.74

1