Title
Optimal Load Allocation for Coded Distributed Computation in Heterogeneous Clusters
Abstract
Recently, coding has been a useful technique to mitigate stragglers' effect in distributed computing. However, coding in this context has been mainly explored assuming homogeneous workers, although real-world clusters often consist of heterogeneous workers with different computing capabilities. The uniform load allocation without considering the heterogeneity possibly causes a significant loss in latency. In this article, we suggest the optimal load allocation for coded distributed computing with heterogeneous workers. Specifically, we focus on the scenario that there exist workers having the same computing capability, which can be regarded as a group for analysis. We rely on the lower bound on the expected latency and obtain the optimal load allocation by showing that our load allocation achieves the minimum of the lower bound for a sufficiently large number of workers. Given the proposed optimal load allocation, we derive the optimal code rate to achieve the minimum expected latency. From numerical simulations, when assuming the group heterogeneity, our load allocation reduces the expected latency by orders of magnitude over the existing scheme. Furthermore, from experiments on Amazon EC2 for scenarios with distinct straggler/heterogeneity patterns, we observe that our scheme outperforms the competing schemes reducing the total finishing time by up to 52%.
Year
DOI
Venue
2019
10.1109/TCOMM.2020.3030667
IEEE Transactions on Communications
Keywords
DocType
Volume
Coded distributed computing,heterogeneous clusters,optimal load allocation
Journal
69
Issue
ISSN
Citations 
1
0090-6778
0
PageRank 
References 
Authors
0.34
0
3
Name
Order
Citations
PageRank
D. Kim128535.51
Hyegyeong Park200.34
Jun-Kyun Choi317543.94