Title
Using failure injection mechanisms to experiment and evaluate a grid failure detector
Abstract
Computing grids are large-scale, highly-distributed, often hierarchical, platforms. At such scales, failures are no longer exceptions, but part of the normal behavior. When designing software for grids, developers have to take failures into account. It is crucial to make experiments at a large scale, with various volatility conditions, in order to measure the impact of failures on the whole system. This paper presents an experimental tool allowing the user to inject failures during a practical evaluation of fault-tolerant systems.We illustrate the usefulness of our tool through an evaluation of a hierarchical grid failure detector.
Year
DOI
Venue
2006
10.1007/978-3-540-71351-7_48
VECPAR
Keywords
Field
DocType
normal behavior,various volatility condition,experimental tool,computing grid,hierarchical grid failure detector,longer exception,practical evaluation,whole system,fault-tolerant system,large scale,failure injection mechanism,failure detector,fault tolerant,grid computing,fault tolerance,fault tolerant system
Failure detector,Grid computing,Computer science,Software,Fault tolerance,Volatility (finance),Grid,Distributed computing
Conference
Volume
ISSN
Citations 
4395
0302-9743
1
PageRank 
References 
Authors
0.35
15
2
Name
Order
Citations
PageRank
sebastien monnet110210.92
Marin Bertier238224.31