Title
Bellamy: Reusing Performance Models for Distributed Dataflow Jobs Across Contexts
Abstract
Distributed dataflow systems enable the use of clusters for scalable data analytics. However, selecting appropriate cluster resources for a processing job is often not straightforward. Performance models trained on historical executions of a concrete job are helpful in such situations, yet they are usually bound to a specific job execution context (e.g. node type, software versions, job parameters...
Year
DOI
Venue
2021
10.1109/Cluster48925.2021.00052
2021 IEEE International Conference on Cluster Computing (CLUSTER)
Keywords
DocType
ISSN
Analytical models,Runtime,Computational modeling,Clustering algorithms,Predictive models,Prediction algorithms,Data models
Conference
1552-5244
ISBN
Citations 
PageRank 
978-1-7281-9666-4
0
0.34
References 
Authors
0
7
Name
Order
Citations
PageRank
Dominik Scheinert121.74
Lauritz Thamsen2439.26
Houkun Zhu320.73
Jonathan Will423.09
Alexander Acker5114.04
Thorsten Wittkopp601.35
Odej Kao7106696.19