Title
Seflow: Efficient Flow Scheduling for Data-Parallel Jobs
Abstract
Data-parallel jobs transfer massive amounts of data between a series of successive stages. The coflow abstraction is proposed to represent a group of parallel flows between two stages and efficiently improves stage-level performance. However, state-of-the-art coflow scheduling techniques are agnostic to the jobs' inter-coflow semantics and thus are suboptimal in reducing the average job completion times (JCT). To address this problem, in this paper we present the "semantic flow" (seflow) abstraction to express the job-level intercoflow semantics. A seflow comprises not only all the coflows of a job but also the relationship between the coflows. We design an efficient seflow scheduler which utilizes the rich seflow semantics of jobs to achieve better performance than seflow-agnostic scheduling for data-parallel jobs.
Year
DOI
Venue
2017
10.1109/ICDCSW.2017.40
2017 IEEE 37th International Conference on Distributed Computing Systems Workshops (ICDCSW)
Keywords
Field
DocType
data-parallel jobs,efficient flow scheduling,seflow,parallel flows,coflow scheduling techniques,average job completion times,JCT,intercoflow semantics,semantic flow abstraction,job-level intercoflow semantics,seflow-agnostic scheduling
Abstraction,Scheduling (computing),Computer science,Parallel computing,Flow scheduling,Bandwidth (signal processing),Processor scheduling,Semantics,Distributed computing
Conference
ISSN
ISBN
Citations 
1545-0678
978-1-5386-3293-2
0
PageRank 
References 
Authors
0.34
8
5
Name
Order
Citations
PageRank
Qiao Zhou101.01
Ziyang Li2404.74
Ping Zhong34011.34
Tian Tian48618.09
Yuxing Peng519445.66