Abstract | ||
---|---|---|
From sensor networks to transportation infrastructure to social networks, we are awash in data. Many of these real-world networks tend to be large (\"big data\") and dynamic, evolving over time. Their evolution can be modeled as a series of graphs. Traditional systems that store and analyze one graph at a time cannot effectively handle the complexity and subtlety inherent in dynamic graphs. Modern analytics require systems capable of storing and processing series of graphs. We present such a system. G* compresses dynamic graph data based on commonalities among the graphs in the series for deduplicated storage on multiple servers. In addition to the obvious space-saving advantage, large-scale graph processing tends to be I/O bound, so faster reads from and writes to stable storage enable faster results. Unlike traditional database and graph processing systems, G* executes complex queries on large graphs using distributed operators to process graph data in parallel. It speeds up queries on multiple graphs by processing graph commonalities only once and sharing the results across relevant graphs. This architecture not only provides scalability, but since G* is not limited to processing only what is available in RAM, its analysis capabilities are far greater than other systems which are limited to what they can hold in memory. This paper presents G*'s design and implementation principles along with evaluation results that document its unique benefits over traditional graph processing systems. |
Year | DOI | Venue |
---|---|---|
2015 | 10.1007/s10619-014-7140-3 | Distributed and Parallel Databases |
Keywords | Field | DocType |
Graphs,Queries,Distributed databases,Parallel computing,Big data | Graph operations,Process graph,Graph database,Computer science,Server,Theoretical computer science,Distributed database,Big data,Graph (abstract data type),Distributed computing,Scalability | Journal |
Volume | Issue | ISSN |
33 | 4 | 0926-8782 |
Citations | PageRank | References |
15 | 0.59 | 37 |
Authors | ||
7 |
Name | Order | Citations | PageRank |
---|---|---|---|
Alan G. Labouseur | 1 | 80 | 6.28 |
Jeremy Birnbaum | 2 | 23 | 2.14 |
Paul W. Olsen | 3 | 87 | 4.66 |
Sean R. Spillane | 4 | 22 | 1.10 |
Jayadevan Vijayan | 5 | 22 | 1.43 |
Jeong-Hyon Hwang | 6 | 1300 | 63.91 |
Wook-Shin Han | 7 | 805 | 57.85 |