Title
Towards scalable data integration under constraints
Abstract
In this paper we consider the problem of answering queries using views, with or without ontological constraints, which is important for data integration, query optimization, and data warehouses. Our context is data integration, so we search for maximally-contained rewritings. We have produced a very scalable and efficient solution for its simplest form, conjunctive queries and views, and we are working towards the full relational case. When considering constraints, the problem is usually divided in two phases: (1) query expansion, which rewrites queries w. r. t. the intentional knowledge and (2) expanded query reformulation using the views. Relevant algorithms have given little attention to the second phase and have studied a limited form of view definition languages overall (namely, only GAV). By looking at the problem from a graph perspective we are able to gain a better insight and develop designs which compactly represent common patterns in the source descriptions, and (optionally) push some computation offline. This allows us to contribute significantly in both aforemention phases individually, tailor one to each other, and moreover address them in a unified way. We intend to provide a solution that supports a variety of ontology languages, and all prevalent view definition languages (G/LAV). Towards such a general and scalable system our preliminary results for the relational case, show an experimental performance about two orders of magnitude faster than current state-of-the-art algorithms, rewriting queries using over 10000 views within seconds.
Year
DOI
Venue
2012
10.1145/2320765.2320835
EDBT/ICDT Workshops
Keywords
Field
DocType
efficient solution,data warehouse,query optimization,expanded query reformulation,limited form,data integration,queries w,conjunctive query,towards scalable data integration,full relational case,query expansion,conjunctive queries,data integrity,testing
Query optimization,Data integration,Conjunctive query,Query language,Information retrieval,Query expansion,Computer science,Web query classification,View,Spatial query
Conference
Citations 
PageRank 
References 
1
0.41
11
Authors
2
Name
Order
Citations
PageRank
George Konstantinidis1588.55
José Luis Ambite2958110.89