Abstract | ||
---|---|---|
Nowadays we see the wide adoption of novel distributed processing frameworks such as Apache Spark for handling batch and stream processing big data applications. An important aspect that has not been examined in these systems is their energy consumption during the application execution. Reducing the power consumption of modern datacenters is a necessity as datacenters contribute over 2% of the total US electric usage. One way of addressing this energy issue is by scheduling the applications in an energy-efficient way. However, efficiently scheduling applications can be challenging as we need to consider the trade-off between the datacenter's energy usage and per application performance requirements. In this work we propose, ExpREsS, a scheduler for orchestrating the execution of Spark applications so that it both minimizes the energy consumption and satisfies the applications' performance requirements. Our approach exploits time-series prediction models for capturing the applications' energy usage and execution times, and then applies a novel DVFS technique to minimize the energy consumption. Our detailed experimental evaluation using realistic workloads on our local cluster illustrates the working and benefits of our approach. |
Year | DOI | Venue |
---|---|---|
2017 | 10.1109/ICAC.2017.43 | 2017 IEEE International Conference on Autonomic Computing (ICAC) |
Keywords | Field | DocType |
Distributed Systems,Scheduling,Green Computing | Spark (mathematics),Computer science,Scheduling (computing),Energy efficient scheduling,Real-time computing,Exploit,Batch processing,Stream processing,Energy consumption,Big data,Embedded system,Distributed computing | Conference |
ISBN | Citations | PageRank |
978-1-5386-1763-2 | 2 | 0.36 |
References | Authors | |
10 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Stathis Maroulis | 1 | 5 | 2.07 |
Nikos Zacheilas | 2 | 79 | 9.40 |
Vana Kalogeraki | 3 | 1686 | 124.40 |