Title
Topology-Aware Data Placement Strategy For Fault-Tolerant Storage Systems
Abstract
In distributed storage systems, fault-tolerant methods such as replication or erasure coding are adopted to guarantee data reliability. These methods ensure that data could be recovered via a redundancy mechanism when any storage node suffers a failure. However, this redundancy mechanism often incurs nontrivial bandwidth overhead to transmit quantities of replicas and blocks. Prior methods focus on how to reduce the network cost through careful scheduling. In this article, we aim to improve the transmission efficiency from an orthogonal dimension, i.e., optimizing the storage locations according to the characteristics of data center networks. We focus on server-centric data centers (such as BCube), where any pair of nodes are interconnected with multiple redundant paths. Thus, transmissions for replicas or blocks can be significantly speeded up via utilizing the redundant paths concurrently. Inspired by this insight, we design the node-disjoint storage strategy and the nested node-disjoint storage strategy for the multireplica storage system and the erasure-coded storage system, respectively. Evaluations indicate that our methods can save 46.6%-62.1% of the transmission time in the multireplica storage system and 71.5%-80.8% of the transmission time in the erasure-coded storage system, compared with conventional methods adopted in current storage systems.
Year
DOI
Venue
2020
10.1109/JSYST.2020.2976720
IEEE SYSTEMS JOURNAL
Keywords
DocType
Volume
Data centers, Fault tolerance, Fault tolerant systems, Encoding, Decoding, Data communication, BCube, data center, distributed storage, erasure code, replication
Journal
14
Issue
ISSN
Citations 
3
1932-8184
0
PageRank 
References 
Authors
0.34
0
4
Name
Order
Citations
PageRank
Junxu Xia163.10
Deke Guo232647.25
Lailong Luo3186.50
Geyao Cheng483.50