Title
High availability in cheap distributed key value storage
Abstract
Memory-based storage currently offers the highest-performance distributed storage, keeping the primary copy of all data in DRAM. Recent advances in non-volatile main memory (NVMM) technologies promise latency similar to DRAM at reduced cost and energy, but will make providing high availability more challenging. Previous approaches to failure recovery involve maintaining multiple identical replicas or relying on fast offline restoration of data from backup replicas stored on SSD. Unfortunately, NVMM's combination of lower write throughput and increased storage density means that offline restoration can no longer provide sufficiently fast recovery, and maintaining multiple identical replicas is generally cost prohibitive. CANDStore is a strongly consistent, distributed, replicated key-value store that uses a new fast crash recovery protocol. As a result, CANDStore can use NVMM and NVMe SSD technology to provide low-latency distributed storage that is cheaper and higher-availability than existing main memory-based distributed storage. Our evaluation shows that CANDStore's recovery protocol enables the system to restore performance and meet SLOs after the failure of a primary node 4.5--10.5x faster than offline recovery.
Year
DOI
Venue
2020
10.1145/3419111.3421290
SoCC '20: ACM Symposium on Cloud Computing Virtual Event USA October, 2020
DocType
ISBN
Citations 
Conference
978-1-4503-8137-6
0
PageRank 
References 
Authors
0.34
0
5
Name
Order
Citations
PageRank
Thomas Kim100.34
Daniel Lin-Kit Wong200.34
Gregory R. Ganger34560383.16
Michael Kaminsky43390229.93
andersen david g54823345.31