Abstract | ||
---|---|---|
High availability (HA) is critical for many stream processing applications such as financial data analysis and disaster response. Existing HA schemes use either active standby or passive standby to guard the system against unexpected failures such as machine crash. Despite previous efforts of simulation-based studies that report active standby is superior, there is a lack of in-depth understanding of the tradeoff between different HA approaches under practical settings. In this paper, we propose a novel sweeping checkpointing method that can reduce the overhead by one order of magnitude. Whereas most previous work addresses single failures, we prove that the sweeping checkpointing method ensures no loss of data even against multiple concurrent failures. We then implement and compare the resulting passive standby variant against active standby using a real stream processing system. We find that passive standby presents a different tradeoff from active standby: longer recovery time, but 90% less overhead. Thus each approach has its suitable scenarios. |
Year | DOI | Venue |
---|---|---|
2009 | 10.5555/1656980.1657012 | Middleware (Companion) |
Keywords | Field | DocType |
financial data analysis,different tradeoff,previous work,empirical study,previous effort,stream processing system,active standby,ha scheme,checkpointing method,different ha approach,passive standby variant,passive standby,high availability,stream processing | Crash,Computer science,Guard (information security),Stream processing,High availability,Empirical research,Distributed computing | Conference |
ISBN | Citations | PageRank |
978-3-642-25820-6 | 24 | 1.06 |
References | Authors | |
16 | 7 |
Name | Order | Citations | PageRank |
---|---|---|---|
Yu Gu 0004 | 1 | 2158 | 127.59 |
Zhe Zhang | 2 | 24 | 1.06 |
Fan Ye | 3 | 2843 | 181.85 |
Hao Yang | 4 | 660 | 48.26 |
Minkyong Kim | 5 | 806 | 60.10 |
Hui Lei | 6 | 1426 | 98.86 |
Zhen Liu | 7 | 1088 | 102.40 |