The power of prediction: microservice auto scaling via workload learning - Citegraph

Paper Info

Title
The power of prediction: microservice auto scaling via workload learning

Abstract
ABSTRACTWhen deploying microservices in production clusters, it is critical to automatically scale containers to improve cluster utilization and ensure service level agreements (SLA). Although reactive scaling approaches work well for monolithic architectures, they are not necessarily suitable for microservice frameworks due to the long delay caused by complex microservice call chains. In contrast, existing proactive approaches leverage end-to-end performance prediction for scaling, but cannot effectively handle microservice multiplexing and dynamic microservice dependencies. In this paper, we present Madu, a proactive microservice auto-scaler that scales containers based on predictions for individual microservices. Madu learns workload uncertainty to handle the highly dynamic dependency between microservices. Additionally, Madu adopts OS-level metrics to optimize resource usage while maintaining good control over scaling overhead. Experiments on large-scale deployments of microservices in Alibaba clusters show that the overall prediction accuracy of Madu can reach as high as 92.3% on average, which is 13% higher than the state-of-the-art approaches. Furthermore, experiments running real-world microservice benchmarks in a local cluster of 20 servers show that Madu can reduce the overall resource usage by 1.7X compared to reactive solutions, while reducing end-to-end service latency by 50%.

Year	DOI	Venue
2022	10.1145/3542929.3563477	International Conference on Management of Data
DocType	Citations	PageRank
Conference	0	0.34
References	Authors
0	7

Authors (7 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Shutian Luo	1	0	0.34
Huanle Xu	2	0	0.34
Kejiang Ye	3	0	0.34
Guoyao Xu	4	0	0.34
Liping Zhang	5	7	1.16
Guodong Yang	6	0	0.34
Chengzhong Xu	7	0	0.34

1