Abstract | ||
---|---|---|
In dynamic grid environment failures (e.g. link down, resource failures) are frequent. We present a fault tolerance scheme for hierarchical dynamic scheduler (HDS) for grid workflow applications [1]. In HDS all resources are arranged in a hierarchy tree and each resource acts as a scheduler. The fault tolerance scheme is fully distributed and is responsible for maintaining the hierarchy tree in the presence of failures. Our fault tolerance scheme handles root failures specially, which avoids root becoming single point of failure. The resources detecting failures are responsible for taking appropriate actions. Our fault tolerance scheme uses randomization to get rid of multiple simultaneous failures. Our simulation results show that the recovery process is fast and the failures affect minimally to the scheduling process. |
Year | DOI | Venue |
---|---|---|
2008 | 10.1109/ICPP-W.2008.7 | ICPP Workshops |
Keywords | Field | DocType |
resource act,hierarchical dynamic schedulers,dynamic grid environment failure,recovery process,root failure,resource failure,grid workflow application,hierarchical dynamic scheduler,fault tolerance scheme,hierarchy tree,scheduling process,scheduling,software fault tolerance,dynamic scheduling,pediatrics,fault tolerant,fault tolerance,grid,grid computing | Single point of failure,Grid computing,Scheduling (computing),Computer science,Parallel computing,Software fault tolerance,Fault tolerance,Hierarchy,Dynamic priority scheduling,Grid,Distributed computing | Conference |
Citations | PageRank | References |
7 | 0.58 | 9 |
Authors | ||
2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Nitin B. Gorde | 1 | 7 | 0.58 |
Sanjeev K. Aggarwal | 2 | 41 | 4.60 |