Title
Fast Online Analytical Processing for Big Data Warehousing
Abstract
In an organizational context where data volume is continuously growing, Online Analytical Processing capabilities are necessary to ensure timely data processing for users that need interactive query processing to support the decision-making process. This paper benchmarks an innovative column-oriented distributed data store, Druid, evaluating its performance in interactive analytical workloads and verifying the impact that different data organizations strategies have in its performance. To achieve this goal, the well-known Star Schema Benchmark is used to verify the impact that the concepts of segments, query granularity and partitions or shards have in the space required to store the data and in the time needed to process it. The obtained results show that scenarios that use partitions usually achieve better processing times, even when that implies an increase in the needed storage space.
Year
DOI
Venue
2018
10.1109/IS.2018.8710583
2018 International Conference on Intelligent Systems (IS)
Keywords
Field
DocType
BigData,Druid,Interactive queries,OLAP
Data processing,Organizational context,Star schema,Computer science,Distributed data store,Granularity,Online analytical processing,Big data,Database
Conference
ISSN
ISBN
Citations 
1541-1672
978-1-5386-7098-9
0
PageRank 
References 
Authors
0.34
4
4
Name
Order
Citations
PageRank
José Higino Correia18018.33
Maribel Yasmina Santos214635.41
Carlos Costa3389.15
Carina Andrade4233.57