Title
Modeling, analysis, and experimental comparison of streaming graph-partitioning policies.
Abstract
In recent years, many distributed graph-processing systems have been designed and developed to analyze large-scale graphs. For all distributed graph-processing systems, partitioning graphs is a key part of processing and an important aspect to achieve good processing performance. To keep low the overhead of partitioning graphs, even when processing the ever-increasing modern graphs, many previous studies use lightweight streaming graph-partitioning policies. Although many such policies exist, currently there is no comprehensive study of their impact on load balancing and communication overheads, and on the overall performance of graph-processing systems. This relative lack of understanding hampers the development and tuning of new streaming policies, and could limit the entire research community to the existing classes of policies. We address these issues in this work. We begin by modeling the execution time of distributed graph-processing systems. By analyzing this model under the load of realistic graph-data characteristics, we propose a method to identify important performance issues and then design new streaming graph-partitioning policies to address them. By using three typical large-scale graphs and three popular graph-processing algorithms, we conduct comprehensive experiments to study the performance of our and of many alternative streaming policies on a real distributed graph-processing system. We also explore the impact on performance of using different real-world networks and of other real-world technical details. We further discuss how to use our results, the coverage of our model and method, and the design of future partitioning policies.
Year
DOI
Venue
2017
10.1016/j.jpdc.2016.02.003
Journal of Parallel and Distributed Computing
Keywords
Field
DocType
Streaming graph-partitioning policies,Large-scale graphs,Graph-processing systems,Modeling analysis,Performance evaluation
Graph,Computer science,Load balancing (computing),Parallel computing,Execution time,Graph partition,Overhead (business),Distributed computing
Journal
Volume
ISSN
Citations 
108
0743-7315
6
PageRank 
References 
Authors
0.42
25
5
Name
Order
Citations
PageRank
Yong Guo1954.60
Sungpack Hong286433.20
Hassan Chafi3111861.11
Alexandru Iosup42042125.89
Dick H. J. Epema53134180.80