Title
Fault-Tolerant Real-time Tasks Scheduling with Dynamic Fault Handling
Abstract
Predictable performance when coping with transient failures is of paramount importance in safety-critical real-time systems. Various software fault-tolerant techniques are employed towards this goal among which check-pointing is a relatively cost-effective scheme. In this paper, we propose an efficient fault-tolerant scheduling framework with run-time fault handling protocol, where criticality levels can be adaptively inserted for fault handling according to run-time fault workload. In contrast to prior works which apply with task re-execution strategy, the proposed framework adaptively determines on-demand re-executions only on the faulty checkpoint segments, rather than on the whole job. Towards this, a unified overrun handling protocol is developed to handle fault recovery adaptively to avoid over-provisioning of resources. In addition, we develop an off-line schedulability analysis technique for the proposed scheduling algorithm. The simulation results show that our fault-tolerant scheduling framework can bring up to 81% improvement in supporting low-criticality service without sacrifice in the MC-schedulability compared with the existing techniques.
Year
DOI
Venue
2020
10.1016/j.sysarc.2019.101688
Journal of Systems Architecture
Keywords
Field
DocType
Fault-tolerant scheduling,Run-time fault handling,Check-pointing,Safety-critical real-time system
Scheduling (computing),Computer science,Workload,Fault handling,Real-time computing,Fault tolerance,Software,Criticality,Distributed computing
Journal
Volume
ISSN
Citations 
102
1383-7621
3
PageRank 
References 
Authors
0.39
0
4
Name
Order
Citations
PageRank
Gang Chen173.92
Nan Guan268549.29
Kai Huang346845.69
Wang Yi44232332.05