Title
A General Analytical Model for Spatial and Temporal Performance of Bitmap Index Compression Algorithms in Big Data
Abstract
Bitmap indexing is flexible to conduct boolean operations in data retrieval. Besides, the query processing based on bitmap indexing is also very fast. Therefore it has been widely used in various big data analytics platforms, such as Druid and Spark etc. However, bitmap index can consume a large amount of memory, which leads to the invention of different kinds of bitmap index compression algorithms without sacrificing temporal performance. In practice, we are often discommoded by choosing a proper algorithm when handling specific problems. Besides, after devising a new algorithm that may outperform existing ones, it is essential to evaluate its performance in theory. Without appropriate theoretical analysis, the deficit of a new algorithm can only be spotted until final experimental results are drawn, thus wasting much time and effort. In this paper, we propose a general analytical model to analyze both the spatial and temporal performance for bitmap index compression algorithms, which can be applied to analyze all kinds of algorithms derived from WAH (word-aligned hybrid). In this model, two types of distributed bitmaps, uniformly distributed bitmaps and clustered bitmaps, are used separately. In order to illustrate this model, several bitmap index compression algorithms are analyzed and compared with each other. Algorithms herein are COMBAT (COMbining Binary And Ternary encoding), SECOMPAX (Scope Extended COMPAX) and CONCISE (Compressed 'n' Composable Integer Set), which are all derived from WAH. Evaluation results by MATLAB simulation about these algorithms are also presented. This paper paves the way for further researches on the performance evaluation of various bitmap index compression algorithms in the future.
Year
DOI
Venue
2015
10.1109/ICCCN.2015.7288362
2015 24th International Conference on Computer Communication and Networks (ICCCN)
Keywords
Field
DocType
general analytical model,bitmap index compression algorithms,Boolean operations,data retrieval,query processing,bitmap indexing,word-aligned hybrid,uniformly distributed bitmaps,clustered bitmaps,COMBAT,combining binary and ternary encoding,SECOMPAX,scope extended COMPAX,CONCISE,compressed 'n' composable integer set
Data mining,Bitmap index,Spark (mathematics),Computer science,Data retrieval,Free space bitmap,Reverse index,Bitmap,Data compression,Encoding (memory)
Conference
ISSN
Citations 
PageRank 
1095-2055
2
0.38
References 
Authors
15
6
Name
Order
Citations
PageRank
Yinjun Wu1114.39
Zhen Chen221836.23
Yuhao Wen3121.75
Junwei Cao493570.95
Wenxun Zheng5142.17
Ge Ma620.38