Title
Ontology-Based Data Quality Management for Data Streams.
Abstract
Data Stream Management Systems (DSMS) provide real-time data processing in an effective way, but there is always a tradeoff between data quality (DQ) and performance. We propose an ontology-based data quality framework for relational DSMS that includes DQ measurement and monitoring in a transparent, modular, and flexible way. We follow a threefold approach that takes the characteristics of relational data stream management for DQ metrics into account. While (1) Query Metrics respect changes in data quality due to query operations, (2) Content Metrics allow the semantic evaluation of data in the streams. Finally, (3) Application Metrics allow easy user-defined computation of data quality values to account for application specifics. Additionally, a quality monitor allows us to observe data quality values and take counteractions to balance data quality and performance. The framework has been designed along a DQ management methodology suited for data streams. It has been evaluated in the domains of transportation systems and health monitoring.
Year
DOI
Venue
2016
10.1145/2968332
J. Data and Information Quality
Keywords
Field
DocType
Data streams,ontologies,data quality assessment,data quality control
Ontology (information science),Ontology,Data mining,Data stream mining,Data processing,Data quality,Relational database,Computer science,Data stream,Management system,Database
Journal
Volume
Issue
ISSN
7
4
1936-1955
Citations 
PageRank 
References 
3
0.41
23
Authors
4
Name
Order
Citations
PageRank
Sandra Geisler131.43
Christoph Quix265165.79
Sven Weber330.41
Matthias Jarke450711762.03