Title
Data Driven Priority Scheduling on Spark Based Stream Processing
Abstract
This paper focuses on priority based processing of streaming data. One of the greatest challenges in big data analytics is responding to a bursty input load. The common solutions are to use dynamic resource provisioning techniques, however, these techniques may not respond quickly enough to the change in the load. Another option is to overprovision, but this results in wasted computing resources. This paper describes a technique that can be used in cases where resources are statically provisioned. This technique enables users to prioritize certain input data items so that in cases where the load suddenly increases, the high priority items are given precedence over low priority items. This technique is implemented on the Spark Streaming engine.
Year
DOI
Venue
2018
10.1109/BDCAT.2018.00034
2018 IEEE/ACM 5th International Conference on Big Data Computing Applications and Technologies (BDCAT)
Keywords
Field
DocType
Spark, Spark Streaming, priority scheduling
Data mining,Spark (mathematics),Data-driven,Computer science,Provisioning,Streaming data,Priority scheduling,Stream processing,Big data,Distributed computing
Conference
ISBN
Citations 
PageRank 
978-1-5386-5503-0
0
0.34
References 
Authors
0
2
Name
Order
Citations
PageRank
Tobi Ajila100.68
Shikharesh Majumdar243575.95