Title
A Provenance Approach to Trace Scientific Experiments on a Grid Infrastructure
Abstract
Large experiments on distributed infrastructures become increasingly complex to manage, in particular to trace all computations that gave origin to a piece of data or an event such as an error. The work presented in this paper describes the design and implementation of an architecture to support experiment provenance and its deployment in the concrete case of a particular e-infrastructure for biosciences. The proposed solution consists of: (a) a data provenance repository to capture scientific experiments and their execution path, (b) a software tool (crawler) that gathers, classifies, links, and stores the information collected from various sources, and (c) a set of user interfaces through which the end-user can access the provenance data, interpret the results, and trace the sources of failure. The approach is based on an OPM-compliant API, PLIER, that is flexible to support future extensions and facilitates interoperability among heterogeneous application systems.
Year
DOI
Venue
2011
10.1109/eScience.2011.27
eScience
Keywords
Field
DocType
provenance data,provenance approach,particular e-infrastructure,future extension,data provenance repository,execution path,facilitates interoperability,grid infrastructure,trace scientific experiments,experiment provenance,heterogeneous application system,concrete case,opm-compliant api,data management,user interfaces,distributed databases,grid computing,knowledge based system,biomedical imaging,knowledge based systems,data interpretation,bioscience,distributed system,computer architecture,distributed systems,distributed database,concrete,metadata,user interface
Metadata,Grid computing,Computer science,Interoperability,e-Science,Distributed database,User interface,Data management,Database,Grid
Conference
ISBN
Citations 
PageRank 
978-1-4577-2163-2
8
0.58
References 
Authors
13
6