Title
Graphine: Programming Graph-Parallel Computation of Large Natural Graphs on Multicore Cluster
Abstract
Graph-parallel computation has become a crucial component in emerging applications of web search, data analytics and machine learning. In practice, most graphs derived from real-world phenomena are very large and scale-free. Unfortunately, distributed graph-parallel computation of these natural graphs still suffers strong scalability issues on contemporary multicore clusters. To embrace the multicore architecture in distributed graph-parallel computation, we propose the framework Graphine, which features (i) A Scatter-Combine computation abstraction that is evolved from the traditional vertex-centric approach by fusing the paired scatter and gather operations, executed separately on two edge sides, into a one-sided scatter. Further coupled with active message mechanism, it potentially reduces intermediate message cost and enables fine-grained parallelism on multicore architecture. (ii) An Agent-Graph data model, which leverages an idea similar to vertex-cut but conceptually splits the remote replica into two agent types of scatter and combiner, resulting in less communication. We implement the Graphine framework and evaluate it using several representative algorithms on six large real-world graphs and a series of synthetic graphs with power-law degree distributions. We show that Graphine achieves sublinear scalability with the number of cores per node, number of nodes, and graph sizes (up to one billion vertices), and is 2 $sim$ 15 times faster than the state-of-the-art PowerGraph on a cluster of 16 multicore nodes.
Year
DOI
Venue
2016
10.1109/TPDS.2015.2453978
Parallel and Distributed Systems, IEEE Transactions
Keywords
Field
DocType
Computational Model,Graph-Parallel,Parallel Framework
Replica,Data modeling,Data analysis,Computer science,Parallel computing,Active message,Multi-core processor,Data model,Scalability,Distributed computing,Computation
Journal
Volume
Issue
ISSN
PP
99
1045-9219
Citations 
PageRank 
References 
2
0.36
25
Authors
4
Name
Order
Citations
PageRank
Jie Yan142.08
Guangming Tan243648.90
Zeyao Mo37319.48
SUN Ning-Hui4126897.37