Title
The Median Resource Failure Checkpointing
Abstract
In grid computing, the realization of an enviable fault tolerance ability is linked with the proper utilization of resources and scheduling of jobs. The literature offers two solutions to these two challenging tasks, viz, checkpointing and replication. A checkpointing strategy is being proposed that uses the median of failure intervals of the resources in deciding the checkpoint intervals for the given jobs. The strategy shows improved system throughput, job losses and job execution times while eliminating unnecessary checkpoints.
Year
DOI
Venue
2012
10.7148/2012-0483-0489
PROCEEDINGS 26TH EUROPEAN CONFERENCE ON MODELLING AND SIMULATION ECMS 2012
Keywords
Field
DocType
Fault tolerance, Checkpointing, Distributed systems
Job losses,Grid computing,Computer science,Scheduling (computing),Fault tolerance,Throughput,Reliability engineering
Conference
Citations 
PageRank 
References 
2
0.35
12
Authors
5
Name
Order
Citations
PageRank
Suleman Khan1554.02
Khizar Hayat224819.71
Sajjad Ahmad Madani340926.21
Samee Ullah Khan4160581.01
Joanna Kolodziej592055.57