Title
Spatio-temporal thermal-aware scheduling for homogeneous high-performance computing datacenters.
Abstract
Datacenters have become an important part of todays computing infrastructure. Recent studies have shown the increasing importance of thermal considerations to achieve effective resource management. In this paper, we study thermal-aware scheduling for homogeneous high-performance computing (HPC) datacenters under a thermal model that captures both spatial and temporal correlations of the temperature evolution. We propose an online scheduling heuristic to minimize the makespan for a set of HPC applications subject to a thermal constraint. The heuristic leverages the novel notion of thermal-aware load to perform both job assignment and thermal management. To respect the temperature constraint, which is governed by a complex spatio-temporal thermal correlation, dynamic voltage and frequency scaling (DVFS) is used to regulate the job executions during runtime while dynamically balancing the loads of the servers to improve makespan. Extensive simulations are conducted based on an experimentally validated datacenter configuration and realistic parameter settings. The results show improved performance of the proposed heuristic compared to existing solutions in the literature, and demonstrate the importance of both spatial and temporal considerations. In contrast to some scheduling problems, where DVFS introduces performanceenergy tradeoffs, our findings reveal the benefit of applying DVFS with both performance and energy gains in the context of spatio-temporal thermal-aware scheduling. Thermal model capturing both spatial and temporal temperature correlations in datacenters.Formulation of a spatio-temporal thermal-aware scheduling problem for HPC applications.Scheduling heuristic using thermal-aware load for job assignment and thermal management.Simulations to show the effectiveness of heuristic under a wide range of parameters.
Year
DOI
Venue
2017
10.1016/j.future.2017.02.005
Future Generation Comp. Syst.
Keywords
Field
DocType
HPC datacenters,Thermal model,Spatio-temporal correlation,Thermal-aware scheduling,Makespan,Energy consumption,DVFS
Resource management,Heuristic,Job shop scheduling,Supercomputer,Scheduling (computing),Computer science,Server,Real-time computing,Frequency scaling,Energy consumption,Distributed computing
Journal
Volume
Issue
ISSN
71
C
0167-739X
Citations 
PageRank 
References 
8
0.47
35
Authors
3
Name
Order
Citations
PageRank
Hongyang Sun112316.35
Patricia Stolf212517.37
Jean-Marc Pierson362359.06