Title
Two-level task scheduling with multi-objectives in geo-distributed and large-scale SaaS cloud
Abstract
With the exploding of data-intensive web applications and requests (tasks), geo-distributed and large-scale data centers (DCs) are widely deployed in Software as a Service (SaaS) cloud, but server failures continue to grow at the same time. In this context, task scheduling problems become more intricate and both scheduling quality and scheduling speed raise further concerns. In this paper, we first propose a virtualized & monitoring SaaS model with predictive maintenance to minimize the costs of fault tolerance. Then with the monitored and predicted available states of servers, we focus on dynamic real-time task scheduling in geo-distributed and large-scale DCs with heterogeneous servers. Multiple objectives, including the long-term performance benefits, energy and communication costs, are taken into consideration in order to improve scheduling quality. For inter-DC and intra-DC task scheduling, two dynamic programming problems are formulated respectively, but there exists the problem that both state and action spaces are too large to be solved by simple iterations. To address this issue, we introduce the idea of reinforcement learning theory into solving traditional stochastic dynamic programming problems in the large-scale SaaS cloud, and put forward a cascaded two-level (inter-DC and intra-DC level) approximate dynamic programming (ADP) task-scheduling algorithm. The computation complexity can be significantly reduced and scheduling speed can be greatly improved. Finally, we conduct experiments with both random simulation data and Google cloud trace-logs. QoS evaluations and comparisons demonstrate that two ADP algorithms can work cooperatively, and our two-level ADP algorithm is more effective under large quantity of bursty requests.
Year
DOI
Venue
2019
10.1007/s11280-019-00680-2
World Wide Web
Keywords
Field
DocType
Multi-objective optimization, SaaS cloud, Data center, Task scheduling, Approximate dynamic programming
Data mining,Computer science,Scheduling (computing),Server,Multi-objective optimization,Software as a service,Data center,Stochastic programming,Cloud computing,Reinforcement learning,Distributed computing
Journal
Volume
Issue
ISSN
22
6
1573-1413
Citations 
PageRank 
References 
0
0.34
0
Authors
5
Name
Order
Citations
PageRank
Puheng Zhang100.34
Xiao Ma200.34
Yanping Xiao300.34
Wenzhuo Li4315.70
Chuang Lin53040390.74