Title
Caladrius: A Performance Modelling Service for Distributed Stream Processing Systems
Abstract
Real-time stream processing has become increasingly important in recent years and has led to the development of a multitude of stream processing systems. Given the varying job workloads that characterize stream processing, these systems need to be tuned and adjusted to maintain performance targets in the face of variation in incoming traffic. Current auto-scaling systems adopt a series of trials to approach a job's expected performance due to a lack of performance modelling tools. We find that general traffic trends in most jobs lend themselves well to prediction. Based on this premise, we built a system called Caladrius that forecasts the future traffic load of a stream processing job and predicts its processing performance after a proposed change to the parallelism of its operators. Experimental results show that Caladrius is able to estimate a job's throughput performance and CPU load under a given scaling configuration.
Year
DOI
Venue
2019
10.1109/ICDE.2019.00204
2019 IEEE 35th International Conference on Data Engineering (ICDE)
Keywords
Field
DocType
Topology,Throughput,Containers,Measurement,Predictive models,Parallel processing,Analytical models
Data mining,Traffic load,Computer science,Parallel processing,Operator (computer programming),Throughput,Cpu load,Stream processing,Scaling,Distributed computing
Conference
ISSN
ISBN
Citations 
1084-4627
978-1-5386-7474-1
1
PageRank 
References 
Authors
0.38
0
14
Name
Order
Citations
PageRank
Faria Kalim142.21
Thomas Cooper230.76
Huijun Wu383.19
Yao Li412.41
Ning Wang523087.46
Neng Lu610.38
Maosong Fu72698.98
Xiaoyao Qian810.38
Hao Luo9187.77
Da Cheng1010.38
Yaliang Wang1111.06
Fred Dai1210.38
Mainak Ghosh13634.63
Beinan Wang1412.07