Title
Algorithm Based Fault Tolerance versus Result-Checking for Matrix Computations
Abstract
Algorithm Based Fault Tolerance (ABFT) is the collective name of a set of techniques used to determine the correctness of some mathematical calculations. A less well known alternative is called Result Checking (RC) where, contrary to ABFT, results are checked without knowledge of the particular algorithm used to calculate them.In this paper a comparison is made between the two using some practical implementations of matrix computations. The criteria are performance and memory overhead, ease of use and error coverage. For the latter extensive error injection experiments were made. To the best of our knowledge, this is the first time that RC is validated by fault injection.We conclude that Result Checking has the important advantage of being independent of the underlying algorithm. It also has generally less performance overhead than ABFT, the two techniques being essentially equivalent in terms of error coverage.
Year
DOI
Venue
1999
10.1109/FTCS.1999.781028
FTCS
Keywords
Field
DocType
fault tolerance,latter extensive error injection,particular algorithm,fault injection,memory overhead,result checking,collective name,error coverage,matrix computations,performance overhead,underlying algorithm,matrix operations,error correction,error detection,fault detection,ease of use,mathematics,matrix computation,computer architecture
Stuck-at fault,Fault coverage,Computer science,Matrix (mathematics),Correctness,Algorithm,Error detection and correction,Fault tolerance,Matrix multiplication,Fault injection
Conference
ISSN
ISBN
Citations 
0731-3071
0-7695-0213-X
14
PageRank 
References 
Authors
1.20
15
2
Name
Order
Citations
PageRank
Paula Prata1266.58
João Gabriel Silva261863.55