Abstract | ||
---|---|---|
Techniques for efficient and distributed processing of huge, unbound data streams have made some impact in the database community. Sensors and data sources, such as position data of moving objects, continuously produce data that is consumed, e.g., by location-aware applications. Depending on the domain of interest, e.g. visualization, the processing of such data often depends on domain-specific functionality. This functionality is specified in terms of dedicated operators that may require specialized hardware, e.g. GPUs. This creates a strong dependency which a data stream processing system must consider when deploying such operators. Many data stream processing systems have been presented so far. However, these systems assume homogeneous computing nodes, do not consider operator deployment constraints, and are not designed to address domain-specific needs. In this paper, we identify necessary features that a flexible and extensible middleware for distributed stream processing of context data must satisfy. We present NexusDS, our approach to achieve these requirements. In NexusDS, data processing is specified by orchestrating data flow graphs, which are modeled as processing pipelines of predefined and general operators as well as custom-built and domain-specific ones. We focus on easy extensibility and support for domain-specific operators and services that may even utilize specific hardware available on dedicated computing nodes. |
Year | DOI | Venue |
---|---|---|
2009 | 10.1145/1620432.1620448 | IDEAS |
Keywords | Field | DocType |
domain-specific need,unbound data stream,extensible middleware,stream processing,context data,domain-specific functionality,data processing,data flow graph,position data,data source,data stream processing system,p2p,data management,satisfiability,middleware,distributed processing | Middleware,Data mining,Data processing,Data stream mining,Computer science,Visualization,Operator (computer programming),Stream processing,Extensibility,Database,Data flow diagram,Distributed computing | Conference |
Citations | PageRank | References |
9 | 0.65 | 22 |
Authors | ||
5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Nazario Cipriani | 1 | 49 | 7.99 |
Mike Eissele | 2 | 31 | 4.09 |
Andreas Brodt | 3 | 110 | 10.91 |
Matthias Grossmann | 4 | 181 | 15.01 |
Bernhard Mitschang | 5 | 928 | 284.89 |