Title
Runtime prediction of parallel applications with workload-aware clustering.
Abstract
Traditionally, many science fields require great support for a massive workflow, which utilizes multiple cores simultaneously. In order to support such large-scale scientific workflows, high-capacity parallel systems such as supercomputers are widely used. To increase the utilization of these systems, most schedulers use backfilling policy based on user’s estimated runtime. However, it is found to be extremely inaccurate because users overestimate their jobs. Therefore, in this paper, an efficient machine learning approach is present to predict the runtime of parallel application. The proposed method is divided into three phases. First is to analyze important feature of the history log data by factor analysis. Second is to carry out clustering for the parallel program based on the important features. Third is to build a prediction models by pattern similarity of parallel program log data and estimate runtime. In the experiments, we use workload logs on parallel systems (i.e., NASA-iPSC, LANL-CM5, SDSC-Par95, SDSC-Par96, and CTC-SP2) to evaluate the effectiveness of our approach. Comparing root-mean-square error with other techniques, experimental results show that the proposed method improves the accuracy up to 69.56%.
Year
DOI
Venue
2017
https://doi.org/10.1007/s11227-017-2038-2
The Journal of Supercomputing
Keywords
Field
DocType
Runtime prediction,Workload-aware clustering,Support vector regression,Machine learning approach
Data mining,Workload,Runtime prediction,Computer science,Support vector machine,Parallel computing,Runtime verification,Predictive modelling,Cluster analysis,Workflow,Distributed computing
Journal
Volume
Issue
ISSN
73
11
0920-8542
Citations 
PageRank 
References 
0
0.34
15
Authors
2
Name
Order
Citations
PageRank
Ju-Won Park1195.09
Eun-Hye Kim21910.40