Title
Novel Crash Recovery Approach for Concurrent Failures in Cluster Federation
Abstract
In this paper, we have proposed a simple and efficient approach for check pointing and recovery in cluster computing environment. The recovery scheme deals with both orphan and lost intra and inter cluster messages. This check pointing scheme ensures that after the system recovers from failures, all processes in different clusters can restart from their respective recent checkpoints; thus avoiding any domino effect. That is, the recent check points always form a consistent recovery line of the cluster federation. The main features of our work are: it uses selective message logging which enables the initiator process in each cluster to log the minimum number of messages, the recovery scheme is domino effect free and is executed simultaneously by all clusters in the cluster federation, it considers concurrent failures, message complexities in each cluster for both check pointing and recovery schemes are just O (n), where n is the number of processes in a cluster.These features make our algorithm superior to the existing works.
Year
DOI
Venue
2009
10.1007/978-3-642-01671-4_39
GPC
Keywords
Field
DocType
cluster federation,consistent recovery line,recovery scheme,novel crash recovery approach,recent check point,recovery scheme deal,inter cluster message,cluster computing environment,concurrent failures,domino effect,message complexity,different cluster,cluster computing
Domino effect,Cluster (physics),Crash,Message logging,Computer science,Computer network,Recovery approach,Computer cluster,Distributed computing
Conference
Volume
ISSN
Citations 
5529
0302-9743
1
PageRank 
References 
Authors
0.35
17
2
Name
Order
Citations
PageRank
B. Gupta115251.48
Shahram Rahimi217240.74