Exposing Provenance Metadata Using Different RDF Models - Citegraph

Paper Info

Title
Exposing Provenance Metadata Using Different RDF Models

Abstract
A standard model for exposing structured provenance metadata of scientific assertions on the Semantic Web would increase interoperability, discoverability, reliability, as well as reproducibility for scientific discourse and evidence-based knowledge discovery. Several Resource Description Framework (RDF) models have been proposed to track provenance. However, provenance metadata may not only be verbose, but also significantly redundant. Therefore, an appropriate RDF provenance model should be efficient for publishing, querying, and reasoning over Linked Data. In the present work, we have collected millions of pairwise relations between chemicals, genes, and diseases from multiple data sources, and demonstrated the extent of redundancy of provenance information in the life science domain. We also evaluated the suitability of several RDF provenance models for this crowdsourced data set, including the N-ary model, the Singleton Property model, and the Nanopublication model. We examined query performance against three commonly used large RDF stores, including Virtuoso, Stardog, and Blazegraph. Our experiments demonstrate that query performance depends on both RDF store as well as the RDF provenance model.

Year	Venue	Field
2015	SWAT4LS	Data mining,RDF query language,Information retrieval,Computer science,Cwm,Stardog,Linked data,SPARQL,RDF/XML,RDF Schema,Database,RDF
DocType	Volume	Citations
Journal	abs/1509.02822	2
PageRank	References	Authors
0.38	9	8

Authors (8 rows)

Cited by (2 rows)

References (9 rows)

Name	Order	Citations	PageRank
Gang Fu	1	207	9.67
Evan E Bolton	2	827	69.52
Núria Queralt-Rosinach	3	210	13.43
Laura Inés Furlong	4	348	21.84
Vinh Nguyen	5	80	8.79
Amit P. Sheth	6	10950	1885.56
Olivier Bodenreider	7	2715	226.05
Michel Dumontier	8	898	93.35

1