Title
Accelerated high-performance computing through efficient multi-process GPU resource sharing
Abstract
The HPC field is witnessing a widespread adoption of GPUs as accelerators for traditional homogeneous HPC systems. One of the prevalent parallel programming models is the SPMD paradigm, which has been adapted for GPU-based parallel processing. Since each process executes the same program under SPMD, every process mapped to a CPU core also needs the GPU availability. Therefore SPMD demands a symmetric CPU/GPU distribution. However, since modern HPC systems feature a large number of CPU cores that outnumber the number of GPUs, computing resources are generally underutilized with SPMD. Our previous efforts have focused on GPU virtualization that enables efficient sharing of GPU among multiple CPU processes. Nevertheless, a formal method to evaluate and choose the appropriate GPU sharing approach is still lacking. In this paper, based on SPMD GPU kernel profiles, we propose different multi-process GPU sharing scenarios under virtualization. We introduce an analytical model that captures these sharing scenarios and provides a theoretical performance gain estimation. Benchmarks validate our analyses and achievable performance gains. While our analytical study provides a suitable theoretical foundation for GPU sharing, the experimental results demonstrate that GPU virtualization affords significant performance improvements over the non-virtualized solutions for all proposed sharing scenarios.
Year
DOI
Venue
2012
10.1145/2212908.2212950
Conf. Computing Frontiers
Keywords
Field
DocType
efficient multi-process gpu resource,accelerated high-performance computing,gpu distribution,gpu sharing,gpu availability,spmd gpu kernel profile,efficient sharing,gpu virtualization,sharing scenario,cpu core,proposed sharing scenario,appropriate gpu sharing approach,parallel programming model,parallel processing,resource sharing,spmd,virtualization,formal method,hpc
Kernel (linear algebra),Virtualization,SPMD,Supercomputer,Homogeneous,Computer science,Parallel computing,Formal methods,Shared resource,Multi-core processor,Distributed computing
Conference
Citations 
PageRank 
References 
1
0.34
3
Authors
3
Name
Order
Citations
PageRank
Teng Li1535.40
Vikram K. Narayana210213.18
Tarek El-Ghazawi342744.88