Title
BitVault: a highly reliable distributed data retention platform
Abstract
This paper summarizes our experience designing and implementing BitVault: a content-addressable retention platform for large volumes of reference data -- seldom-changing information that needs to be retained for a long time. BitVault uses "smart bricks" as the building block to lower the hardware cost. The challenges are to keep management costs low in a system that scales from one brick to tens of thousands, to ensure reliability, and to deliver a simple design. Our design incorporates peer-to-peer (P2P) technologies for self-managing and self-healing and uses massively parallel repair to reduce system vulnerability to data loss. The simplicity of the architecture relies on an eventually reliable membership service provided by a perfect one-hop distributed hash table (DHT). Its object-driven repair model yields last-replica recall guarantee independent of the failure scenario. So long as the last copy of a data object remains in the system, that data can be retrieved and its replication degree can be restored. A prototype has been implemented. Theoretical analysis, simulations and experiments have been conducted to validate the design of BitVault.
Year
DOI
Venue
2007
10.1145/1243418.1243423
Operating Systems Review
Keywords
Field
DocType
parallel repair,building block,reference data,content-addressable retention platform,long time,system vulnerability,simple design,data loss,data object,data retention platform,object-driven repair model yield,service provider,p2p,distributed hash table
Peer-to-peer,Data retention,Data loss,Massively parallel,Computer science,Real-time computing,Fault tolerance,Hash function,Distributed computing,Project management,Distributed hash table
Journal
Volume
Issue
Citations 
41
2
18
PageRank 
References 
Authors
1.01
31
6
Name
Order
Citations
PageRank
Zheng Zhang1119373.82
Qiao Lian223512.64
Shiding Lin325712.28
Wei Chen43416170.71
Yu Chen558541.84
Chao Jin6181.01