Title
D4M: Bringing associative arrays to database engines
Abstract
The ability to collect and analyze large amounts of data is a growing problem within the scientific community. The growing gap between data and users calls for innovative tools that address the challenges faced by big data volume, velocity and variety. Numerous tools exist that allow users to store, query and index these massive quantities of data. Each storage or database engine comes with the promise of dealing with complex data. Scientists and engineers who wish to use these systems often quickly find that there is no single technology that offers a panacea to the complexity of information. When using multiple technologies, however, there is significant trouble in designing the movement of information between storage and database engines to support an end-to-end application along with a steep learning curve associated with learning the nuances of each underlying technology. In this article, we present the Dynamic Distributed Dimensional Data Model (D4M) as a potential tool to unify database and storage engine operations. Previous articles on D4M have showcased the ability of D4M to interact with the popular NoSQL Accumulo database. Recently however, D4M now operates on a variety of backend storage or database engines while providing a federated look to the end user through the use of associative arrays. In order to showcase how new databases may be supported by D4M, we describe the process of building the D4M-SciDB connector and present performance of this connection.
Year
DOI
Venue
2015
10.1109/HPEC.2015.7322472
2015 IEEE High Performance Extreme Computing Conference (HPEC)
Keywords
Field
DocType
Big Data,Data Analytics,Dimensional Analysis,Federated Databases
Data mining,World Wide Web,Data administration,Database model,Computer science,View,Database design,NoSQL,Database engine,Database theory,Big data,Database
Journal
Volume
ISSN
Citations 
abs/1508.07371
2377-6943
17
PageRank 
References 
Authors
0.89
13
14
Name
Order
Citations
PageRank
Vijay Gadepally144950.53
Jeremy Kepner260661.58
William Arcand317517.77
David Bestor418119.08
Bill Bergeron516816.57
Chansup Byun618019.21
Lauren Edwards7372.62
Matthew Hubbell819220.93
Peter Michaleas920120.93
Julie Mullen1013815.22
Andrew Prout1118218.78
Antonio Rosa1217017.67
Charles Yee1314715.14
Albert Reuther1433537.32