Title
RADICAL-Pilot: Scalable Execution of Heterogeneous and Dynamic Workloads on Supercomputers
Abstract
Traditionally high-performance computing (HPC) systems have been optimized to support mostly monolithic workloads. The workload of many important scientific applications however, is comprised of spatially and temporally heterogeneous tasks that are often dynamically inter-related. These workloads can benefit from being executed at scale on HPC resources but a tension exists between their resource utilization requirements and the capabilities of HPC system software and HPC usage policies. Pilot systems have successfully been used to address this tension. In this paper we introduce RADICAL-Pilot (RP), a scalable and interoperable pilot system that faithfully implements the Pilot abstraction. We describe its design and characterize the performance of its components, as well as its performance on multiple heterogeneous HPC systems. Specifically, we characterize RPu0027s task execution component (the RP Agent), which is engineered for optimal resource utilization while maintaining the full generality of the Pilot abstraction.
Year
Venue
DocType
2015
arXiv: Distributed, Parallel, and Cluster Computing
Journal
Volume
Citations 
PageRank 
abs/1512.08194
6
0.54
References 
Authors
2
4
Name
Order
Citations
PageRank
Andre Merzky113020.45
Mark Santcroos2708.11
Matteo Turilli38416.21
Shantenu Jha418832.40