Title
Failover strategy for fault tolerance in cloud computing environment.
Abstract
Cloud fault tolerance is an important issue in cloud computing platforms and applications. In the event of an unexpected system failure or malfunction, a robust fault-tolerant design may allow the cloud to continue functioning correctly possibly at a reduced level instead of failing completely. To ensure high availability of critical cloud services, the application execution, and hardware performance, various fault-tolerant techniques exist for building self-autonomous cloud systems. In comparison with current approaches, this paper proposes a more robust and reliable architecture using optimal checkpointing strategy to ensure high system availability and reduced system task service finish time. Using pass rates and virtualized mechanisms, the proposed smart failover strategy (SFS) scheme uses components such as cloud fault manager, cloud controller, cloud load balancer, and a selection mechanism, providing fault tolerance via redundancy, optimized selection, and checkpointing. In our approach, the cloud fault manager repairs faults generated before the task time deadline is reached, blocking unrecoverable faulty nodes as well as their virtual nodes. This scheme is also able to remove temporary software faults from recoverable faulty nodes, thereby making them available for future request. We argue that the proposed SFS algorithm makes the system highly fault tolerant by considering forward and backward recovery using diverse software tools. Compared with existing approaches, preliminary experiment of the SFS algorithm indicates an increase in pass rates and a consequent decrease in failure rates, showing an overall good performance in task allocations. We present these results using experimental validation tools with comparison with other techniques, laying a foundation for a fully fault-tolerant infrastructure as a service cloud environment. Copyright (C) 2017 John Wiley & Sons, Ltd.
Year
DOI
Venue
2017
10.1002/spe.2491
SOFTWARE-PRACTICE & EXPERIENCE
Keywords
Field
DocType
cloud computing,fault tolerance,checkpointing,virtualization,load balancing,virtual machine
Virtualization,Failover,Virtual machine,Load balancing (computing),Computer science,Software fault tolerance,Fault tolerance,Operating system,Cloud testing,Distributed computing,Cloud computing
Journal
Volume
Issue
ISSN
47.0
SP9.0
0038-0644
Citations 
PageRank 
References 
8
0.48
14
Authors
5
Name
Order
Citations
PageRank
Bashir Mohammed1223.90
Mariam Kiran212117.83
Kabiru M. Maiyama380.48
Mumtaz M. Kamala480.48
Irfan Ullah Awan571187.15