Title
BIPie: Fast Selection and Aggregation on Encoded Data using Operator Specialization.
Abstract
Advances in modern hardware, such as increases in the size of main memory available on computers, have made it possible to analyze data at a much higher rate than before. In this paper, we demonstrate that there is tremendous room for improvement in the processing of analytical queries on modern commodity hardware. We introduce BIPie, an engine for query processing implementing highly efficient decoding, selection, and aggregation for analytical queries executing on a columnar storage engine in MemSQL. We demonstrate that these operations are interdependent, and must be fused and considered together to achieve very high performance. We propose and compare multiple strategies for decoding, selection and aggregation (with GROUP BY), all of which are designed to take advantage of modern CPU architectures, including SIMD. We implemented these approaches in MemSQL, a high performance hybrid transaction and analytical processing database designed for commodity hardware. We thoroughly evaluate the performance of the approach across a range of parameters, and demonstrate a two to four times speedup over previously published TPC-H Query 1 performance.
Year
DOI
Venue
2018
10.1145/3183713.3190658
SIGMOD/PODS '18: International Conference on Management of Data Houston TX USA June, 2018
Field
DocType
ISSN
Data mining,Computer science,Parallel computing,SIMD,Operator (computer programming),Decoding methods,Database transaction,Commodity hardware,Speedup
Conference
0730-8078
ISBN
Citations 
PageRank 
978-1-4503-4703-7
0
0.34
References 
Authors
16
5
Name
Order
Citations
PageRank
Michal Nowakiewicz1432.05
Eric Boutin2822.66
Eric N. Hanson3917376.11
Robert Walzer421.05
Akash Katipally500.34