Title
Ground-Truth Prediction to Accelerate Soft-Error Impact Analysis for Iterative Methods
Abstract
Understanding the impact of soft errors on applications can be expensive. Often, it requires an extensive error injection campaign involving numerous runs of the full application in the presence of errors. In this paper, we present a novel approach to arriving at the ground truth-the true impact of an error on the final output-for iterative methods by observing a small number of iterations to learn deviations between normal and error-impacted execution. We develop a machine learning based predictor for three iterative methods to generate ground-truth results without running them to completion for every error injected. We demonstrate that this approach achieves greater accuracy than alternative prediction strategies, including three existing soft error detection strategies. We demonstrate the effectiveness of the ground truth prediction model in evaluating vulnerability and the effectiveness of soft error detection strategies in the context of iterative methods.
Year
DOI
Venue
2019
10.1109/HiPC.2019.00048
2019 IEEE 26th International Conference on High Performance Computing, Data, and Analytics (HiPC)
Keywords
Field
DocType
Iterative Solvers,Failure Prediction,Machine Leraning,Soft Errors,Detectors
Small number,Soft error,Computer science,Iterative method,Parallel computing,Algorithm,Ground truth,Soft error detection,Detector
Conference
ISSN
ISBN
Citations 
1094-7256
978-1-7281-4536-5
0
PageRank 
References 
Authors
0.34
0
5
Name
Order
Citations
PageRank
Burcu Ozcelik Mutlu141.88
Gokcen Kestor214814.25
Adrian Cristal350032.64
Osman Unsal416414.33
Sriram Krishnamoorthy5120286.68