Title | ||
---|---|---|
SEALDB: An Efficient LSM-tree based KV Store on SMR Drives with Sets and Dynamic Bands |
Abstract | ||
---|---|---|
Key-value (KV) stores play an increasingly critical role in supporting diverse large-scale applications in modern data centers hosting terabytes of KV items which even might reside on a single server due to virtualization purposes. The combination of the ever-growing volume of KV items and storage/application consolidation is driving a trend of high storage density for KV stores. Shingled Magnetic Recording (SMR) represents a promising technology for increasing disk capacity, which however comes with the increased complexity of handling random writes. To take the best advantages of SMR drives, applications are expected to work in an SMR-friendly way. In this work, we present SEALDB, a Log-Structured Merge tree (LSM-tree) based key-value store that is specifically optimized for SMR drives via avoiding random writes and the corresponding write amplification on SMR drives. First, for LSM-trees, SEALDB collects and groups participating data of each compaction into sets. Using a set as the basic unit for compactions, SEALDB improves compaction efficiency by reducing random I/Os. Second, SEALDB creates variable sized bands on original HM-SMR drives, named dynamic bands. Dynamic bands store sets in an SMR-friendly way to eliminate the auxiliary write amplification from SMR drives. Third, SEALDB employs two light-weight garbage collection (GC) policies to further improve the space efficiency. We demonstrate the advantages of SEALDB via extensive experiments with various workloads. Overall, SEALDB delivers impressive performance compared with LevelDB, e.g., $3.42\times$3.42×/$2.65\times$2.65× faster for random writes (without or with GCs), and $3.96\times$3.96× faster for sequential reads. |
Year | DOI | Venue |
---|---|---|
2019 | 10.1109/tpds.2019.2918219 | IEEE Transactions on Parallel and Distributed Systems |
Keywords | Field | DocType |
Compaction,Drives,Computer architecture,Magnetic recording,Software,Data centers,Databases | Virtualization,Computer science,Terabyte,Write amplification,Software,Shingled magnetic recording,Garbage collection,Merge (version control),Operating system,Distributed computing | Journal |
Volume | Issue | ISSN |
30 | 11 | 1045-9219 |
Citations | PageRank | References |
2 | 0.37 | 0 |
Authors | ||
7 |
Name | Order | Citations | PageRank |
---|---|---|---|
Ting Yao | 1 | 842 | 52.62 |
Zhihu Tan | 2 | 6 | 3.62 |
Jiguang Wan | 3 | 29 | 9.71 |
Ping Huang | 4 | 184 | 29.52 |
Yiwen Zhang | 5 | 28 | 5.81 |
changsheng | 6 | 3 | 2.46 |
Xubin He | 7 | 747 | 63.49 |