Title
NexusDS: a flexible and extensible middleware for distributed stream processing
Abstract
Techniques for efficient and distributed processing of huge, unbound data streams have made some impact in the database community. Sensors and data sources, such as position data of moving objects, continuously produce data that is consumed, e.g., by location-aware applications. Depending on the domain of interest, e.g. visualization, the processing of such data often depends on domain-specific functionality. This functionality is specified in terms of dedicated operators that may require specialized hardware, e.g. GPUs. This creates a strong dependency which a data stream processing system must consider when deploying such operators. Many data stream processing systems have been presented so far. However, these systems assume homogeneous computing nodes, do not consider operator deployment constraints, and are not designed to address domain-specific needs. In this paper, we identify necessary features that a flexible and extensible middleware for distributed stream processing of context data must satisfy. We present NexusDS, our approach to achieve these requirements. In NexusDS, data processing is specified by orchestrating data flow graphs, which are modeled as processing pipelines of predefined and general operators as well as custom-built and domain-specific ones. We focus on easy extensibility and support for domain-specific operators and services that may even utilize specific hardware available on dedicated computing nodes.
Year
DOI
Venue
2009
10.1145/1620432.1620448
IDEAS
Keywords
Field
DocType
domain-specific need,unbound data stream,extensible middleware,stream processing,context data,domain-specific functionality,data processing,data flow graph,position data,data source,data stream processing system,p2p,data management,satisfiability,middleware,distributed processing
Middleware,Data mining,Data processing,Data stream mining,Computer science,Visualization,Operator (computer programming),Stream processing,Extensibility,Database,Data flow diagram,Distributed computing
Conference
Citations 
PageRank 
References 
9
0.65
22
Authors
5
Name
Order
Citations
PageRank
Nazario Cipriani1497.99
Mike Eissele2314.09
Andreas Brodt311010.91
Matthias Grossmann418115.01
Bernhard Mitschang5928284.89