Title
Skipping Unused Events to Speed Up Rollback-Recovery in Distributed Data-Parallel CEP
Abstract
We propose two extensions for a state-of-the-art method of rollback-recovery in distributed CEP (complex event processing). In CEP, an operator network is used to search for patterns in events streams. Sometimes these operators fail and lose their state. Rollback-recovery is a method for dealing with such state losses. The type of rollback-recovery we consider is upstream backup, where the state of a failed operator is recovered by replaying to it the input events that led it to that state. These events are kept in upstream operators' memory buffers, which are trimmed continuously as the downstream operator progresses. The first extension we propose saves memory and speeds up recovery by avoiding to store and retransmit unnecessary events. The second extension makes the base method of upstream backup compatible with data-parallel CEP, allowing that the windows into which operators partition their input be processed in parallel. We evaluated the proposed extensions through experiments that showed a significant reduction in memory usage and recovery time at the expense of a negligible processing overhead during normal operation.
Year
DOI
Venue
2018
10.1109/BDCAT.2018.00013
2018 IEEE/ACM 5th International Conference on Big Data Computing Applications and Technologies (BDCAT)
Keywords
Field
DocType
complex event processing,fault-tolerance,reliability,recovery
Data mining,Computer science,Complex event processing,Real-time computing,Fault tolerance,Operator (computer programming),Rollback recovery,Backup,Speedup
Conference
ISBN
Citations 
PageRank 
978-1-5386-5503-0
2
0.36
References 
Authors
0
5
Name
Order
Citations
PageRank
Guilherme Augusto Ferreira Lima1133.76
Ahmad Slo291.79
Sukanya Bhowmik3565.99
Markus Endler443255.44
Kurt Rothermel518423.72