Title
Fast Accurate Summary Warehouses with Distributed Summaries
Abstract
Large Data warehouses (DW) put a major challenge in what concerns performance and scalability, as users request instant answers to their queries. Traditional solutions relying on very expensive architectures and structures cannot turn every complex aggregation query into minutes or seconds answers. The summary warehouse (SW) achieves such a speedup using only general-purpose sampling summaries well-fit for aggregated exploration analysis. The major limitation of SWs results from the tradeoff between accuracy and speed: smaller, faster summaries cannot answer less-aggregated queries. We propose a simple and cheap strategy to meet these conflicting requirements and deliver unseen speedup by taking advantage of distributed computation ubiquity. The distributed summaries approach (DS) proposed in this paper manages a distributed set of summaries that are put in available computing nodes of a local area network to achieve very fast query processing, while guaranteeing enough accuracy.
Year
DOI
Venue
2003
10.1109/IDEAS.2003.1214958
SEVENTH INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM, PROCEEDINGS
Keywords
Field
DocType
ubiquitous computing,hardware,data warehouse,sampling methods,distributed computing,local area networks,computer networks,data mining,database management systems,computational complexity,online analytical processing,investments,data warehouses,concurrent computing,local area network,scalability,dw,computer architecture
Data warehouse,Data mining,Computer science,Local area network,Concurrent computing,Ubiquitous computing,Online analytical processing,Database,Speedup,Computational complexity theory,Scalability
Conference
Citations 
PageRank 
References 
0
0.34
11
Authors
2
Name
Order
Citations
PageRank
Pedro Furtado120455.67
João Pedro Costa26411.99