Title
The BigDAWG polystore system and architecture
Abstract
Organizations are often faced with the challenge of providing data management solutions for large, heterogenous datasets that may have different underlying data and programming models. For example, a medical dataset may have unstructured text, relational data, time series waveforms and imagery. Trying to fit such datasets in a single data management system can have adverse performance and efficiency effects. As a part of the Intel Science and Technology Center on Big Data, we are developing a polystore system designed for such problems. BigDAWG (short for the Big Data Analytics Working Group) is a polystore system designed to work on complex problems that naturally span across different processing or storage engines. BigDAWG provides an architecture that supports diverse database systems working with different data models, support for the competing notions of location transparency and semantic completeness via islands and a middleware that provides a uniform multi-island interface. Initial results from a prototype of the BigDAWG system applied to a medical dataset validate polystore concepts. In this article, we will describe polystore databases, the current BigDAWG architecture and its application on the MIMIC II medical dataset, initial performance results and our future development plans.
Year
DOI
Venue
2016
10.1109/HPEC.2016.7761636
2016 IEEE High Performance Extreme Computing Conference (HPEC)
Keywords
DocType
Volume
Big Data analytics working group,BigDAWG polystore system,large heterogenous datasets,data models,programming models,single data management system,intel science and technology center,complex problems,processing engines,storage engines,location transparency,semantic completeness,middleware,uniform multiisland interface,MIMIC II medical dataset
Conference
abs/1609.07548
ISSN
ISBN
Citations 
2377-6943
978-1-5090-3526-7
14
PageRank 
References 
Authors
0.63
19
9
Name
Order
Citations
PageRank
Vijay Gadepally144950.53
Peinan Chen2140.63
Jennie Duggan322912.42
Aaron J. Elmore435234.03
Brandon Haynes5375.77
J. Kepner621515.51
Samuel Madden7161011176.38
Tim Mattson8917.21
Michael Stonebraker9124634310.17