Title
Dynamic Fault-Tolerant Workflow Scheduling With Hybrid Spatial-Temporal Re-Execution In Clouds
Abstract
Improving reliability is one of the major concerns of scientific workflow scheduling in clouds. The ever-growing computational complexity and data size of workflows present challenges to fault-tolerant workflow scheduling. Therefore, it is essential to design a cost-effective fault-tolerant scheduling approach for large-scale workflows. In this paper, we propose a dynamic fault-tolerant workflow scheduling (DFTWS) approach with hybrid spatial and temporal re-execution schemes. First, DFTWS calculates the time attributes of tasks and identifies the critical path of workflow in advance. Then, DFTWS assigns appropriate virtual machine (VM) for each task according to the task urgency and budget quota in the phase of initial resource allocation. Finally, DFTWS performs online scheduling, which makes real-time fault-tolerant decisions based on failure type and task criticality throughout workflow execution. The proposed algorithm is evaluated on real-world workflows. Furthermore, the factors that affect the performance of DFTWS are analyzed. The experimental results demonstrate that DFTWS achieves a trade-off between high reliability and low cost objectives in cloud computing environments.
Year
DOI
Venue
2019
10.3390/info10050169
INFORMATION
Keywords
Field
DocType
cloud computing, workflow scheduling, fault-tolerant, re-execution
Data mining,Virtual machine,Scheduling (computing),Computer science,Fault tolerance,Resource allocation,Critical path method,Workflow,Cloud computing,Distributed computing,Computational complexity theory
Journal
Volume
Issue
Citations 
10
5
1
PageRank 
References 
Authors
0.35
0
3
Name
Order
Citations
PageRank
Na Wu163.51
De-Cheng Zuo28618.87
Zhan Zhang31910.81