Title
GPUvm: why not virtualizing GPUs at the hypervisor?
Abstract
Graphics processing units (GPUs) provide orders-of-magnitude speedup for compute-intensive data-parallel applications. However, enterprise and cloud computing domains, where resource isolation of multiple clients is required, have poor access to GPU technology. This is due to lack of operating system (OS) support for virtualizing GPUs in a reliable manner. To make GPUs more mature system citizens, we present an open architecture of GPU virtualization with a particular emphasis on the Xen hypervisor. We provide design and implementation of full- and para-virtualization, including optimization techniques to reduce overhead of GPU virtualization. Our detailed experiments using a relevant commodity GPU show that the optimized performance of GPU para-virtualization is yet two or three times slower than that of pass-through and native approaches, whereas full-virtualization exhibits a different scale of overhead due to increased memory-mapped I/O operations. We also demonstrate that coarse-grained fairness on GPU resources among multiple virtual machines can be achieved by GPU scheduling; finer-grained fairness needs further architectural support by the nature of non-preemptive GPU workload.
Year
Venue
Field
2014
USENIX Annual Technical Conference
Virtualization,Virtual machine,Open architecture,Scheduling (computing),CUDA,Computer science,Parallel computing,Hypervisor,Real-time computing,Operating system,Speedup,Cloud computing
DocType
Citations 
PageRank 
Conference
36
1.02
References 
Authors
32
4
Name
Order
Citations
PageRank
Yusuke Suzuki1473.96
Shinpei Kato295162.18
Hiroshi Yamada316925.23
kenji kono41488.43