Effective Parallel Computing via a Free Stale Synchronous Parallel Strategy

As the data becomes bigger and more complex, people tend to process it in a distributed system implemented on clusters. Due to the power consumption, cost, and differentiated price-performance, the clusters are evolving into the system with heterogeneous hardware leading to the performance differenc...

Full description

Saved in:
Bibliographic Details
Main Authors: Hang Shi, Yue Zhao, Bofeng Zhang, Kenji Yoshigoe, Furong Chang
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8811465/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:As the data becomes bigger and more complex, people tend to process it in a distributed system implemented on clusters. Due to the power consumption, cost, and differentiated price-performance, the clusters are evolving into the system with heterogeneous hardware leading to the performance difference among the nodes. Even in a homogeneous cluster, the performance of the nodes is different due to the resource competition and the communication cost. Some nodes with poor performance will drag down the efficiency of the whole system. Existing parallel computing strategies such as bulk synchronous parallel strategy and stale synchronous parallel strategy are not well suited to this problem. To address it, we proposed a free stale synchronous parallel (FSSP) strategy to free the system from the negative impact of those nodes. FSSP is improved from stale synchronous parallel (SSP) strategy, which can effectively and accurately figure out the slow nodes and eliminate the negative effects of those nodes. We validated the performance of the FSSP strategy by using some classical machine learning algorithms and datasets. Our experimental results demonstrated that FSSP was 1.5-<inline-formula> <tex-math notation="LaTeX">$12\times $ </tex-math></inline-formula> faster than the bulk synchronous parallel strategy and stale synchronous parallel strategy, and it used <inline-formula> <tex-math notation="LaTeX">$4\times $ </tex-math></inline-formula> fewer iterations than the asynchronous parallel strategy to converge.
ISSN:2169-3536