Abstract | ||
---|---|---|
Existing de-duplication solutions in cloud backup environment either obtain high compression ratios at the cost of heavy de-duplication overheads in terms of increased latency and reduced throughput, or maintain small de-duplication overheads at the cost of low compression ratios causing high data transmission costs, which results in a large backup window. In this paper, we present SAM, a Semantic-Aware Multitiered source de-duplication framework that first combines the global file-level de-duplication and local chunk-level deduplication, and further exploits file semantics in each stage in the framework, to obtain an optimal tradeoff between the deduplication efficiency and de-duplication overhead and finally achieve a shorter backup window than existing approaches. Our experimental results with real world datasets show that SAM not only has a higher de-duplication efficiency/overhead ratio than existing solutions, but also shortens the backup window by an average of 38.7%. |
Year | DOI | Venue |
---|---|---|
2010 | 10.1109/ICPP.2010.69 | ICPP |
Keywords | Field | DocType |
global file level deduplication,backup window,compression ratio,semantic-aware multitiered source de-duplication,semantic aware multitiered source deduplication framework,higher de-duplication efficiency,cloud backup,data compression,de-duplication solution,data deduplication,large backup window,de-duplication overhead,heavy de-duplication overhead,file semantics,cloud backup environment,internet,high data transmission cost,local chunk level deduplication,semantic-aware multi-tiered source de-duplication,global file-level de-duplication,small de-duplication overhead,client-server systems,servers,redundancy,indexes,semantics,data transmission | Data deduplication,Computer science,Computer network,Continuous data protection,Throughput,Data compression,Backup software,Backup,Distributed computing,Overhead (business),Cloud computing | Conference |
ISSN | ISBN | Citations |
0190-3918 E-ISBN : 978-0-7695-4156-3 | 978-0-7695-4156-3 | 33 |
PageRank | References | Authors |
1.36 | 13 | 6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Yujuan Tan | 1 | 138 | 23.48 |
Hong Jiang | 2 | 2137 | 157.96 |
Dan Feng | 3 | 1845 | 188.16 |
Lei Tian | 4 | 853 | 39.45 |
Zhichao Yan | 5 | 82 | 10.60 |
Guohui Zhou | 6 | 73 | 29.90 |