Title
DeepEE: Joint Optimization of Job Scheduling and Cooling Control for Data Center Energy Efficiency Using Deep Reinforcement Learning
Abstract
The past decade witnessed the tremendous growth of power consumption in data centers due to the rapid development of cloud computing, big data analytics, and machine learning, etc. The prior approaches that optimize the power consumption of the information technology (IT) system and/or the cooling system always fail to capture the system dynamics or suffer from the complexity of system states and action spaces. In this paper, we propose a Deep Reinforcement Learning (DRL) based optimization framework, named DeepEE, to improve the energy efficiency for data centers by considering the IT and cooling systems concurrently. In DeepEE, we first propose a PArameterized action space based Deep Q-Network (PADQN) algorithm to solve the hybrid action space problem and jointly optimize the job scheduling for the IT system and the airflow rate adjustment for the cooling system. Then, a two-time-scale control mechanism is applied in PADQN to coordinate the IT and cooling systems more accurately and efficiently. In addition, to train and evaluate the proposed PADQN in a safe and quick way, we build a simulation platform to model the dynamics of IT workload and cooling systems simultaneously. Through extensive real-trace based simulations, we demonstrate that: 1) our algorithm can save up to 15% and 10% energy consumption in comparison with the baseline siloed and joint optimization approaches respectively; 2) our algorithm achieves more stable performance gain in terms of power consumption by adopting the parameterized action space; and 3) our algorithm leads to a better tradeoff between energy saving and service quality.
Year
DOI
Venue
2019
10.1109/ICDCS.2019.00070
2019 IEEE 39th International Conference on Distributed Computing Systems (ICDCS)
Keywords
Field
DocType
Deep reinforcement learning,Data center,Energy efficiency,Job scheduling,Cooling control
Efficient energy use,Computer science,System dynamics,Job scheduler,Water cooling,Data center,Energy consumption,Reinforcement learning,Distributed computing,Cloud computing
Conference
ISSN
ISBN
Citations 
1063-6927
978-1-7281-2520-6
2
PageRank 
References 
Authors
0.37
0
4
Name
Order
Citations
PageRank
Yongyi Ran120.71
Han Hu29311.20
Xin Zhou312615.50
Yonggang Wen42512156.47