Title | ||
---|---|---|
RADICAL-Pilot: Scalable Execution of Heterogeneous and Dynamic Workloads on Supercomputers |
Abstract | ||
---|---|---|
Traditionally high-performance computing (HPC) systems have been optimized to support mostly monolithic workloads. The workload of many important scientific applications however, is comprised of spatially and temporally heterogeneous tasks that are often dynamically inter-related. These workloads can benefit from being executed at scale on HPC resources but a tension exists between their resource utilization requirements and the capabilities of HPC system software and HPC usage policies. Pilot systems have successfully been used to address this tension. In this paper we introduce RADICAL-Pilot (RP), a scalable and interoperable pilot system that faithfully implements the Pilot abstraction. We describe its design and characterize the performance of its components, as well as its performance on multiple heterogeneous HPC systems. Specifically, we characterize RPu0027s task execution component (the RP Agent), which is engineered for optimal resource utilization while maintaining the full generality of the Pilot abstraction. |
Year | Venue | DocType |
---|---|---|
2015 | arXiv: Distributed, Parallel, and Cluster Computing | Journal |
Volume | Citations | PageRank |
abs/1512.08194 | 6 | 0.54 |
References | Authors | |
2 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Andre Merzky | 1 | 130 | 20.45 |
Mark Santcroos | 2 | 70 | 8.11 |
Matteo Turilli | 3 | 84 | 16.21 |
Shantenu Jha | 4 | 188 | 32.40 |