Title
YCSB++: benchmarking and performance debugging advanced features in scalable table stores
Abstract
Inspired by Google's BigTable, a variety of scalable, semi-structured, weak-semantic table stores have been developed and optimized for different priorities such as query speed, ingest speed, availability, and interactivity. As these systems mature, performance benchmarking will advance from measuring the rate of simple workloads to understanding and debugging the performance of advanced features such as ingest speed-up techniques and function shipping filters from client to servers. This paper describes YCSB++, a set of extensions to the Yahoo! Cloud Serving Benchmark (YCSB) to improve performance understanding and debugging of these advanced features. YCSB++ includes multi-tester coordination for increased load and eventual consistency measurement, multi-phase workloads to quantify the consequences of work deferment and the benefits of anticipatory configuration optimization such as B-tree pre-splitting or bulk loading, and abstract APIs for explicit incorporation of advanced features in benchmark tests. To enhance performance debugging, we customized an existing cluster monitoring tool to gather the internal statistics of YCSB++, table stores, system services like HDFS, and operating systems, and to offer easy post-test correlation and reporting of performance behaviors. YCSB++ features are illustrated in case studies of two BigTable-like table stores, Apache HBase and Accumulo, developed to emphasize high ingest rates and finegrained security.
Year
DOI
Venue
2011
10.1145/2038916.2038925
SoCC
Keywords
Field
DocType
performance behavior,advanced feature,bigtable-like table store,scalable table store,performance benchmarking,table store,performance understanding,weak-semantic table store,performance debugging,query speed,multi-phase workloads,benchmarking,operating system
Eventual consistency,Interactivity,Computer science,Server,Real-time computing,NoSQL,Operating system,Benchmarking,Database,Debugging,Cloud computing,Scalability
Conference
Citations 
PageRank 
References 
85
3.64
26
Authors
9
Name
Order
Citations
PageRank
Swapnil Patil130618.05
Milo Polte21588.43
Kai Ren322912.85
Wittawat Tantisiriroj41084.98
Lin Xiao51024.97
Julio López616511.33
Garth Gibson725713.77
Adam Fuchs8954.34
Billie Rinaldi9853.64