Title
Intelligent colocation of HPC workloads
Abstract
Many HPC applications suffer from a bottleneck in the shared caches, instruction execution units, I/O or memory bandwidth, even though the remaining resources may be underutilized. It is hard for developers and runtime systems to ensure that all critical resources are fully exploited by a single application, so an attractive technique for increasing HPC system utilization is to colocate multiple applications on the same server. When applications share critical resources, however, contention on shared resources may lead to reduced application performance.
Year
DOI
Venue
2021
10.1016/j.jpdc.2021.02.010
Journal of Parallel and Distributed Computing
Keywords
DocType
Volume
Resource manager,HPC systems,Machine learning,Colocation,Performance Characterization,Performance counters
Journal
151
ISSN
Citations 
PageRank 
0743-7315
0
0.34
References 
Authors
0
5
Name
Order
Citations
PageRank
Felippe V. Zacarias100.34
Vinicius Petrucci221213.68
Rajiv Nishtala3253.79
Paul M. Carpenter410215.48
Daniel Mossé52184148.86