Title
Dynamic Fault Tolerance Through Resource Pooling
Abstract
Miniaturized satellites are currently not considered suitable for critical, high-priority, and complex multi-phased missions, due to their low reliability. As hardware-side fault tolerance (FT) solutions designed for larger spacecraft cannot be adopted aboard very small satellites due to budget, energy, and size constraints, we developed a hybrid FT-approach based upon only COTS components, commodity processor cores, library IP, and standard software. This approach facilitates fault detection, isolation, and recovery in software, and utilizes fault-coverage techniques across the embedded stack within a multiprocessor system-on-chip (MPSoC). This allows our FPGA-based proof- of-concept implementation to deliver strong fault-coverage even for missions with a long duration, but also to adapt to varying performance requirements during the mission. The operator of a spacecraft utilizing this approach can define performance profiles, which allow an on-board computer (OBC) to trade between processing capacity, fault coverage, and energy consumption using simple heuristics. The software-side FT approach developed also offers advantages if deployed aboard larger spacecraft through spare resource pooling, enabling an OBC to more efficiently handle permanent faults. This FT approach in part mimics a critical biological system's ability to tolerate faults, adapt to permanent failure, and enables graceful aging of a MPSoC.
Year
DOI
Venue
2019
10.1109/AHS.2018.8541457
2018 NASA/ESA Conference on Adaptive Hardware and Systems (AHS)
Keywords
DocType
ISSN
multiprocessor system-on-chip,FPGA-based proof- of-concept implementation,strong fault-coverage,performance requirements,embedded stack,utilizes fault-coverage techniques,fault detection,standard software,library IP,commodity processor cores,COTS components,hybrid FT-approach,size constraints,hardware-side fault tolerance solutions,low reliability,complex multiphased missions,high-priority,critical priority,miniaturized satellites,dynamic fault tolerance,MPSoC,critical biological system,permanent faults,spare resource pooling,FT approach,energy consumption,fault coverage,performance profiles,spacecraft,fault isolation,fault-coverage techniques,onboard computer,OBC,processing capacity
Journal
1939-7003
ISBN
Citations 
PageRank 
978-1-5386-7754-4
0
0.34
References 
Authors
20
5
Name
Order
Citations
PageRank
Christian M. Fuchs111.72
Nadia M. Murillo211.05
Aske Plaat352472.18
Erik van der Kouwe4589.55
todor stefanov560245.61