Title
Data-Driven Job Dispatching in HPC Systems.
Abstract
As High Performance Computing (HPC) systems get closer to exascale performance, job dispatching strategies become critical for keeping system utilization high while keeping waiting times low for jobs competing for HPC system resources. In this paper, we take a data-driven approach and investigate whether better dispatching decisions can be made by transforming the log data produced by an HPC system into useful knowledge about its workload. In particular, we focus on job duration, develop a data-driven approach to job duration prediction, and analyze the effect of different prediction approaches in making dispatching decisions using a real workload dataset collected from Eurora, a hybrid HPC system. Experiments on various dispatching methods show promising results.
Year
DOI
Venue
2017
10.1007/978-3-319-72926-8_37
Lecture Notes in Computer Science
Field
DocType
Volume
Data-driven,Supercomputer,Workload,Computer science,Distributed computing
Conference
10710
ISSN
Citations 
PageRank 
0302-9743
3
0.39
References 
Authors
18
6
Name
Order
Citations
PageRank
Cristian Galleguillos1305.96
Alina Sîrbu2679.06
Zeynep Kiziltan337427.79
Ozalp Babaoglu41867135.64
Andrea Borghesi5172.06
Thomas Bridi6101.17