Title
Modeling and predicting execution time of scientific workflows in the Grid using radial basis function neural network.
Abstract
With the maturity of electronic science (e-science) the scientific applications are growing to be more complex composed of a set of coordinating tasks with complex dependencies among them referred to as workflows. For optimized execution of workflows in the Grid, the high level middleware services (like task scheduler, resource broker, performance steering service etc.) need in-advance estimates of workflow execution times. However, modeling and predicting workflow execution time in the Grid is complex due to several tasks in a workflow, their distributed execution on multiple heterogeneous Grid-sites, and dynamic behaviour of the shared Grid resources. In this paper, we describe a novel method based on radial basis function neural network to model and predict workflow execution time in the Grid. We model workflows execution time in terms of attributes describing workflow structure and execution runtime information. To further refine our models, we employ principle component analysis to eliminate attributes of lesser importance. We recommend a set of only 14 attributes (as compared with total 21) to effectively model workflow execution time. Our reduced set of attributes improves the prediction accuracy by \(16\%\). Results of our prediction experiments for three real-world scientific workflows are presented to show that our predictions are more accurate than the two best methods from related work so far.
Year
DOI
Venue
2017
10.1007/s10586-017-1018-x
Cluster Computing
Keywords
Field
DocType
Scientific workflow applications, Distributed execution of scientific workflows, Workflow execution time prediction in the Grid
Middleware,Data mining,Workflow technology,Computer science,Windows Workflow Foundation,Real-time computing,Workflow engine,Workflow management system,Workflow,Principal component analysis,Grid,Distributed computing
Journal
Volume
Issue
ISSN
20
3
1386-7857
Citations 
PageRank 
References 
3
0.41
45
Authors
6
Name
Order
Citations
PageRank
Farrukh Nadeem11168.06
Daniyal M. Alghazzawi27819.62
Abdulfattah S. Mashat3276.85
Khalid Fakeeh430.41
Abdullah Almalaise530.41
Hani Hagras61747129.26