Title
NEMESIS: A software approach for computing in presence of soft errors.
Abstract
Soft errors are considered as the main reliability challenge for sub-nanoscale microprocessors. Software-level soft error resilience schemes are desirable because they require no hardware modifications and their protection can be tuned based on the application requirements. However, existing software-level error tolerant schemes do not provide high-level of protection. In this work, we present NEMESIS - a compiler-level fine-grain soft error detection, diagnosis and recovery technique that can provide high degree of error-resiliency. NEMESIS runs three versions of computations and detects soft errors by checking the results of all memory write and branch operations. In the case of mismatch, NEMESIS recovery routine reverts the effect of error from the architectural state of the program and program resumes its normal execution. Our extensive μ-architectural-level fault injection experiments results show that NEMESIS transformation is able to detect all soft errors and recover from 97% of detected errors.
Year
DOI
Venue
2017
10.1109/ICCAD.2017.8203792
ICCAD
Keywords
Field
DocType
Soft Errors,Reliability,Compiler Optimization,Silent Data Corruption
Silent data corruption,Soft error,Computer science,Optimizing compiler,Real-time computing,Software,Soft error detection,Computer engineering,Fault injection,Computation
Conference
ISSN
ISBN
Citations 
1933-7760
978-1-4503-5950-4
2
PageRank 
References 
Authors
0.35
19
3
Name
Order
Citations
PageRank
Moslem Didehban1163.33
Aviral Shrivastava281268.67
Sai Ram Dheeraj Lokam330.70