Title
ExpREsS: EneRgy Efficient Scheduling of Mixed Stream and Batch Processing Workloads
Abstract
Nowadays we see the wide adoption of novel distributed processing frameworks such as Apache Spark for handling batch and stream processing big data applications. An important aspect that has not been examined in these systems is their energy consumption during the application execution. Reducing the power consumption of modern datacenters is a necessity as datacenters contribute over 2% of the total US electric usage. One way of addressing this energy issue is by scheduling the applications in an energy-efficient way. However, efficiently scheduling applications can be challenging as we need to consider the trade-off between the datacenter's energy usage and per application performance requirements. In this work we propose, ExpREsS, a scheduler for orchestrating the execution of Spark applications so that it both minimizes the energy consumption and satisfies the applications' performance requirements. Our approach exploits time-series prediction models for capturing the applications' energy usage and execution times, and then applies a novel DVFS technique to minimize the energy consumption. Our detailed experimental evaluation using realistic workloads on our local cluster illustrates the working and benefits of our approach.
Year
DOI
Venue
2017
10.1109/ICAC.2017.43
2017 IEEE International Conference on Autonomic Computing (ICAC)
Keywords
Field
DocType
Distributed Systems,Scheduling,Green Computing
Spark (mathematics),Computer science,Scheduling (computing),Energy efficient scheduling,Real-time computing,Exploit,Batch processing,Stream processing,Energy consumption,Big data,Embedded system,Distributed computing
Conference
ISBN
Citations 
PageRank 
978-1-5386-1763-2
2
0.36
References 
Authors
10
3
Name
Order
Citations
PageRank
Stathis Maroulis152.07
Nikos Zacheilas2799.40
Vana Kalogeraki31686124.40