Abstract | ||
---|---|---|
Transient hardware faults during the execution of a program can cause data corruptions. We present HAFT, a fault tolerance technique using hardware extensions of commodity CPUs to protect unmodified multithreaded applications against such corruptions. HAFT utilizes instruction-level redundancy for fault detection and hardware transactional memory for fault recovery. We evaluated HAFT with Phoenix and PARSEC benchmarks. The observed normalized runtime is 2x, with 98.9% of the injected data corruptions being detected and 91.2% being corrected. To demonstrate the effectiveness of HAFT, we applied it to real-world case studies including Memcached, Apache, and SQLite. |
Year | DOI | Venue |
---|---|---|
2016 | 10.1145/2901318.2901339 | EuroSys |
Field | DocType | Citations |
Parsec,Computer science,Fault detection and isolation,Covert channel,Real-time computing,Transactional memory,Redundancy (engineering),Fault tolerance,Computer hardware,Operating system,Embedded system | Conference | 13 |
PageRank | References | Authors |
0.71 | 54 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Dmitrii Kuvaiskii | 1 | 58 | 6.42 |
Rasha Faqeh | 2 | 16 | 1.79 |
Pramod Bhatotia | 3 | 414 | 28.94 |
Pascal Felber | 4 | 2432 | 178.76 |
Christof Fetzer | 5 | 2429 | 172.89 |