Abstract | ||
---|---|---|
As the data becomes bigger and more complex, people tend to process it in a distributed system implemented on clusters. Due to the power consumption, cost, and differentiated price-performance, the clusters are evolving into the system with heterogeneous hardware leading to the performance difference among the nodes. Even in a homogeneous cluster, the performance of the nodes is different due to the resource competition and the communication cost. Some nodes with poor performance will drag down the efficiency of the whole system. Existing parallel computing strategies such as bulk synchronous parallel strategy and stale synchronous parallel strategy are not well suited to this problem. To address it, we proposed a free stale synchronous parallel (FSSP) strategy to free the system from the negative impact of those nodes. FSSP is improved from stale synchronous parallel (SSP) strategy, which can effectively and accurately figure out the slow nodes and eliminate the negative effects of those nodes. We validated the performance of the FSSP strategy by using some classical machine learning algorithms and datasets. Our experimental results demonstrated that FSSP was 1.5-12x faster than the bulk synchronous parallel strategy and stale synchronous parallel strategy, and it used 4x fewer iterations than the asynchronous parallel strategy to converge. |
Year | DOI | Venue |
---|---|---|
2019 | 10.1109/ACCESS.2019.2936820 | IEEE ACCESS |
Keywords | DocType | Volume |
Straggler, parallel strategy, parallel programming | Journal | 7 |
ISSN | Citations | PageRank |
2169-3536 | 0 | 0.34 |
References | Authors | |
0 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Hang Shi | 1 | 0 | 0.68 |
Yue Zhao | 2 | 58 | 28.59 |
Bofeng Zhang | 3 | 179 | 41.38 |
Kenji Yoshigoe | 4 | 84 | 13.88 |
Furong Chang | 5 | 0 | 1.69 |