Title
QoS-aware dynamic resource allocation for spatial-multitasking GPUs.
Abstract
General-purpose computing on GPUs (GPGPU computing) is becoming widely adopted; however, some GPGPU applications fail to fully utilize GPU resources. In these cases, spatial multitasking better exploits the parallelism offered by GPUs by partitioning the GPU resources among simultaneously-running applications. When one or more such applications have quality-of-service (QoS) requirements, enough resources must be allocated for those applications to satisfy their requirements. Remaining resources can be either disabled to reduce power consumption or used to accelerate other applications. However, we observe that the amount of resources for a QoS application to satisfy its performance requirement is dependent in part upon the co-executing applications. In this paper, we propose a runtime technique to dynamically partition GPU resources between concurrently running applications-at least one of which has a QoS requirement. We demonstrate that the proposed technique can satisfy a 100% QoS requirement while also achieving either a 7W power consumption reduction or a 17.57% performance improvement for co-executing best-effort applications.
Year
DOI
Venue
2014
10.1109/ASPDAC.2014.6742976
ASP-DAC
Keywords
Field
DocType
graphics processing units,quality of service,resource allocation,GPGPU computing,QoS-aware dynamic resource allocation,dynamically partition resources,general-purpose computing,power 7 W,quality-of-service requirements,runtime technique,simultaneously-running applications,spatial-multitasking GPU
Qos aware,Computer science,Quality of service,Resource allocation (computer),Exploit,Real-time computing,Resource allocation,General-purpose computing on graphics processing units,Human multitasking,Performance improvement
Conference
ISSN
Citations 
PageRank 
2153-6961
8
0.43
References 
Authors
6
3
Name
Order
Citations
PageRank
Paula Aguilera1423.67
Katherine Morrow21445.33
Nam Sung Kim33268225.99