Title
HAFT: hardware-assisted fault tolerance.
Abstract
Transient hardware faults during the execution of a program can cause data corruptions. We present HAFT, a fault tolerance technique using hardware extensions of commodity CPUs to protect unmodified multithreaded applications against such corruptions. HAFT utilizes instruction-level redundancy for fault detection and hardware transactional memory for fault recovery. We evaluated HAFT with Phoenix and PARSEC benchmarks. The observed normalized runtime is 2x, with 98.9% of the injected data corruptions being detected and 91.2% being corrected. To demonstrate the effectiveness of HAFT, we applied it to real-world case studies including Memcached, Apache, and SQLite.
Year
DOI
Venue
2016
10.1145/2901318.2901339
EuroSys
Field
DocType
Citations 
Parsec,Computer science,Fault detection and isolation,Covert channel,Real-time computing,Transactional memory,Redundancy (engineering),Fault tolerance,Computer hardware,Operating system,Embedded system
Conference
13
PageRank 
References 
Authors
0.71
54
5
Name
Order
Citations
PageRank
Dmitrii Kuvaiskii1586.42
Rasha Faqeh2161.79
Pramod Bhatotia341428.94
Pascal Felber42432178.76
Christof Fetzer52429172.89