Title
Enabling efficient preemption for SIMT architectures with lightweight context switching.
Abstract
Context switching is a key technique enabling preemption and time-multiplexing for CPUs. However, for single-instruction multiple-thread (SIMT) processors such as high-end graphics processing units (GPUs), it is challenging to support context switching due to the massive number of threads, which leads to a huge amount of architectural states to be swapped during context switching. The architectural state of SIMT processors includes registers, shared memory, SIMT stacks and barrier states. Recent works present thread-block-level preemption on SIMT processors to avoid context switching overhead. However, because the execution time of a thread block (TB) is highly dependent on the kernel program. The response time of preemption cannot be guaranteed and some TB-level preemption techniques cannot be applied to all kernel functions. In this paper, we propose three complementary ways to reduce and compress the architectural states to achieve lightweight context switching on SIMT processors. Experiments show that our approaches can reduce the register context size by 91.5% on average. Based on lightweight context switching, we enable instruction-level preemption on SIMT processors with compiler and hardware co-design. With our proposed schemes, the preemption latency is reduced by 59.7% on average compared to the naive approach.
Year
DOI
Venue
2016
10.1109/SC.2016.76
SC
Keywords
Field
DocType
SIMT architecture efficient preemption,lightweight context switching,CPU preemption,CPU time-multiplexing,single-instruction multiple-thread processors,high-end graphics processing units,GPU,SIMT processor architectural states,registers,shared memory,SIMT stacks,barrier states,thread-block-level preemption,TB level preemption,kernel program,preemption response time,instruction-level preemption,compiler,hardware codesign
Kernel (linear algebra),Preemption,Shared memory,Instruction set,Computer science,Parallel computing,Response time,Compiler,Thread (computing),Distributed computing,Context switch
Conference
ISBN
Citations 
PageRank 
978-1-4673-8815-3
16
0.58
References 
Authors
25
3
Name
Order
Citations
PageRank
Zhen Lin1354.21
Lars S. Nyland2596.26
Huiyang Zhou399463.26