Title
A conceptual model and predicate language for data selection and projection based on provenance
Abstract
Writing relational database queries over current provenance databases can be complex and error-prone because application data is typically mixed with provenance data, because queries may require recursion, and because the form in which provenance is maintained requires procedural parsing not easily framed in query syntax. As a result, it is often difficult to write queries that select (rows or columns of) data based on provenance. In this paper, we contribute a conceptual model and a predicate language for use in relational algebra that allows the user to write simple, nonrecursive queries to select data and attributes based on provenance. Our model also includes novel data and provenance features, including multi-valued attributes, that are useful for data curation settings. We show that our predicate language supports a broad class of queries that select application data based on provenance. We also show how selection of data with our language extensions can be emulated with an existing graph database system and its associated query language.
Year
Venue
Keywords
2010
TaPP
provenance feature,current provenance databases,data curation setting,provenance data,novel data,language extension,data selection,conceptual model,predicate language,application data,select application data,associated query language
Field
DocType
Citations 
Row,Query language,Graph database,Programming language,Information retrieval,Relational database,Conceptual model,Computer science,Data curation,Relational algebra,Parsing
Conference
1
PageRank 
References 
Authors
0.38
14
2
Name
Order
Citations
PageRank
David W. Archer1505.28
Lois M. L. Delcambre2992420.78