Abstract | ||
---|---|---|
While there has been substantial work on both database and workflow provenance, the two problems have only been examined in isolation. It is widely accepted that the existing models are incompatible. Database provenance is fine-grained and captures changes to tuples in a database. In contrast, workflow provenance is represented at a coarser level and reflects the functional model of workflow systems, which is stateless--each computational step derives a new artifact. In this paper, we propose a new approach to combine database and workflow provenance. We address the mismatch between the different kinds of provenance by using a temporal model which explicitly represents the database states as updates are applied. We discuss how, under this model, reproducibility is obtained for workflows that manipulate databases, and how different queries that straddle the two provenance traces can be evaluated. We also describe a proof-of-concept implementation that integrates a workflow system and a commercial relational database. |
Year | DOI | Venue |
---|---|---|
2012 | 10.1007/978-3-642-34222-6_2 | IPAW |
Keywords | Field | DocType |
workflow provenance,provenance trace,functional model,workflow system,temporal model,existing model,database provenance,different kind,commercial relational database,database state | Data mining,Relational database,Tuple,Computer science,Provenance,Straddle,Workflow engine,Workflow,Database | Conference |
Volume | ISSN | Citations |
7525 | 0302-9743 | 6 |
PageRank | References | Authors |
0.50 | 10 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Fernando Seabra Chirigati | 1 | 205 | 16.38 |
Juliana Freire | 2 | 3956 | 270.89 |