Abstract | ||
---|---|---|
The aim of this paper is to illustrate the use of application and system level logs to better understand scientific data center behavior and energy-spending. Analyzing a data center log of 900 nodes (Sandy Bridge and Haswell), we study node power consumption and describe approaches to estimate and forecast it. Our results include methods to cluster nodes based on different vmstat and RAPL measurements as well as Gaussian and GAM models for estimating the plug power consumption. We also analyze failed jobs and find that non-successfully terminated jobs consume around 40% of computing time. While the actual numbers are likely to vary in different data centers at different times, the purpose of the paper is to share ideas of what can be found by statistical and machine learning analysis of large amount of log data. |
Year | DOI | Venue |
---|---|---|
2019 | 10.1007/s00450-018-0394-7 | Computer Science - Research and Development |
Keywords | Field | DocType |
RAPL, Energy modeling, Energy efficiency, Data center log analysis | Energy modeling,Efficient energy use,Computer science,Real-time computing,Gaussian,Data center,System level,Power consumption | Journal |
Volume | Issue | ISSN |
34 | 1 | 2524-8529 |
Citations | PageRank | References |
0 | 0.34 | 8 |
Authors | ||
8 |
Name | Order | Citations | PageRank |
---|---|---|---|
Kashif Nizam Khan | 1 | 57 | 6.09 |
Kashif Nizam Khan | 2 | 57 | 6.09 |
Sanja Scepanovic | 3 | 5 | 2.79 |
Tapio Niemi | 4 | 163 | 18.90 |
Jukka K. Nurminen | 5 | 649 | 59.58 |
Jukka K. Nurminen | 6 | 649 | 59.58 |
Sebastian von Alfthan | 7 | 3 | 1.81 |
Olli-Pekka Lehto | 8 | 0 | 0.34 |