Title
ART: adaptive, reliable, and fault-tolerant task management for computational grids
Abstract
The main goal of ART is reducing the number of replications by using checkpointing and rollback scheme for each replication. In ART, the minimum number of replications is adaptively selected based on analysis of probability of successful execution within the given deadline and reliability requirement of each task. Simulation results show that ART outperforms existing mechanisms.
Year
DOI
Venue
2010
10.1145/1774088.1774136
SAC
Keywords
Field
DocType
main goal,rollback scheme,successful execution,simulation result,minimum number,reliability requirement,computational grid,fault-tolerant task management,fault tolerance,grid computing,fault tolerant
Grid computing,Task management,Computer science,Parallel computing,Fault tolerance,Rollback,Distributed computing
Conference
Citations 
PageRank 
References 
1
0.36
2
Authors
5
Name
Order
Citations
PageRank
Sangho Yi153835.84
Jung Y. Kim272.66
Hong Min35713.34
Bongjae Kim4157.10
Chang Oan Sung5358.76