Title
Fairness in dataflow scheduling in the cloud.
Abstract
Expensive dataflow queries which may involve large-scale computations operating on significant volumes of data are typically executed on distributed platforms to improve application performance. Among these, cloud computing has emerged as an attractive option for users to execute dataflows allowing them to select proper configurations (e.g., number of machines) to achieve desired trade-offs between execution time and monetary cost. Discovering dataflow schedules that exhibit the best trade-offs within a plethora of potential solutions can be challenging, especially in a heterogeneous environment where resource characteristics like performance and price can be varied. To increase resource utilization, users may also submit multiple dataflows for execution concurrently. Traditionally, building fair schedules (schedules where the slowdown of all dataflows due to resource sharing is similar) while achieving good performance is a major concern. However, considering fairness in the cloud computing setting where monetary cost is part of the optimization objectives significantly increases the difficulty of the scheduling problem. This paper proposes an algorithm for the scheduling of multiple dataflows on heterogeneous clouds that identifies Pareto-optimal solutions (schedules) in the three-dimensional space formed from the different trade-offs between overall execution time, monetary cost and fairness. The results show that in most cases the proposed approach can provide solutions with fairer schedules without significantly impacting the quality of the execution time to monetary cost skyline compared to the state of the art where the fairness of a solution is not taken into account.
Year
DOI
Venue
2019
10.1016/j.is.2019.03.003
Information Systems
Keywords
Field
DocType
Cloud computing,Multiple dataflows,Fairness,Dataflow scheduling
Skyline,Job shop scheduling,Scheduling (computing),Computer science,Dataflow,Schedule,Shared resource,Database,Cloud computing,Computation,Distributed computing
Journal
Volume
ISSN
Citations 
83
0306-4379
1
PageRank 
References 
Authors
0.35
0
3
Name
Order
Citations
PageRank
Ilia Pietri1847.94
Yannis Chronis231.72
Yannis E. Ioannidis354971988.40