Title
The G* graph database: efficiently managing large distributed dynamic graphs
Abstract
From sensor networks to transportation infrastructure to social networks, we are awash in data. Many of these real-world networks tend to be large (\"big data\") and dynamic, evolving over time. Their evolution can be modeled as a series of graphs. Traditional systems that store and analyze one graph at a time cannot effectively handle the complexity and subtlety inherent in dynamic graphs. Modern analytics require systems capable of storing and processing series of graphs. We present such a system. G* compresses dynamic graph data based on commonalities among the graphs in the series for deduplicated storage on multiple servers. In addition to the obvious space-saving advantage, large-scale graph processing tends to be I/O bound, so faster reads from and writes to stable storage enable faster results. Unlike traditional database and graph processing systems, G* executes complex queries on large graphs using distributed operators to process graph data in parallel. It speeds up queries on multiple graphs by processing graph commonalities only once and sharing the results across relevant graphs. This architecture not only provides scalability, but since G* is not limited to processing only what is available in RAM, its analysis capabilities are far greater than other systems which are limited to what they can hold in memory. This paper presents G*'s design and implementation principles along with evaluation results that document its unique benefits over traditional graph processing systems.
Year
DOI
Venue
2015
10.1007/s10619-014-7140-3
Distributed and Parallel Databases
Keywords
Field
DocType
Graphs,Queries,Distributed databases,Parallel computing,Big data
Graph operations,Process graph,Graph database,Computer science,Server,Theoretical computer science,Distributed database,Big data,Graph (abstract data type),Distributed computing,Scalability
Journal
Volume
Issue
ISSN
33
4
0926-8782
Citations 
PageRank 
References 
15
0.59
37
Authors
7
Name
Order
Citations
PageRank
Alan G. Labouseur1806.28
Jeremy Birnbaum2232.14
Paul W. Olsen3874.66
Sean R. Spillane4221.10
Jayadevan Vijayan5221.43
Jeong-Hyon Hwang6130063.91
Wook-Shin Han780557.85