Title
Fault Tolerance Through Redundant Execution on COTS Multicores: Exploring Trade-Offs
Abstract
High availability and integrity are paramount in systems deployed in life-and mission-critical scenarios. Such fault-tolerance can be achieved through redundant co-execution (RCoE) on replicated hardware, now cheaply available with multicore processors. RCoE replicates almost all software, including OS kernel, drivers, and applications, achieving a sphere of replication that covers everything except the minimal interfaces to non-replicated peripherals. We complement our original, loosely-coupled RCoE with a closely-coupled version that improves transparency of replication to application code, and investigate the functionality, performance and vulnerability trade-offs.
Year
DOI
Venue
2019
10.1109/DSN.2019.00031
2019 49th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN)
Keywords
DocType
ISSN
seL4,microkernel,SEU,replication,fault tolerance
Conference
1530-0889
ISBN
Citations 
PageRank 
978-1-7281-0058-6
0
0.34
References 
Authors
27
3
Name
Order
Citations
PageRank
Yanyan Shen145749.77
Gernot Heiser22525137.42
K. Elphinstone3119065.76