Title
Towards integrating workflow and database provenance
Abstract
While there has been substantial work on both database and workflow provenance, the two problems have only been examined in isolation. It is widely accepted that the existing models are incompatible. Database provenance is fine-grained and captures changes to tuples in a database. In contrast, workflow provenance is represented at a coarser level and reflects the functional model of workflow systems, which is stateless--each computational step derives a new artifact. In this paper, we propose a new approach to combine database and workflow provenance. We address the mismatch between the different kinds of provenance by using a temporal model which explicitly represents the database states as updates are applied. We discuss how, under this model, reproducibility is obtained for workflows that manipulate databases, and how different queries that straddle the two provenance traces can be evaluated. We also describe a proof-of-concept implementation that integrates a workflow system and a commercial relational database.
Year
DOI
Venue
2012
10.1007/978-3-642-34222-6_2
IPAW
Keywords
Field
DocType
workflow provenance,provenance trace,functional model,workflow system,temporal model,existing model,database provenance,different kind,commercial relational database,database state
Data mining,Relational database,Tuple,Computer science,Provenance,Straddle,Workflow engine,Workflow,Database
Conference
Volume
ISSN
Citations 
7525
0302-9743
6
PageRank 
References 
Authors
0.50
10
2
Name
Order
Citations
PageRank
Fernando Seabra Chirigati120516.38
Juliana Freire23956270.89