Title
Design and evaluation of an ir-benchmark for sparql queries with fulltext conditions
Abstract
In this paper, we describe our goals in introducing a new, annotated benchmark collection, with which we aim to bridge the gap between the fundamentally different aspects that are involved in querying both structured and unstructured data. This semantically rich collection, captured in a unified XML format, combines components (unstructured text, semistructured infoboxes, and category structure) from 3.1 Million Wikipedia articles with highly structured RDF properties from both DBpedia and YAGO2. The new collection serves as the basis of the INEX 2012 Ad-hoc, Faceted Search, and Jeopardy retrieval tasks. With a focus on the new Jeopardy task, we particularly motivate the usage of the collection for question-answering (QA) style retrieval settings, which we also exemplify by introducing a set of 90 QA-style benchmark queries which come shipped in a SPARQL-based query format that has been extended by fulltext filter conditions.
Year
DOI
Venue
2012
10.1145/2390148.2390154
ESAIR
Keywords
Field
DocType
annotated benchmark collection,style retrieval setting,unified xml format,unstructured data,fulltext condition,sparql query,qa-style benchmark query,jeopardy retrieval task,semantically rich collection,new collection,sparql-based query format,new jeopardy task,rdf,linked data
World Wide Web,XML,Information retrieval,Faceted search,Computer science,Linked data,Unstructured data,SPARQL,RDF
Conference
Citations 
PageRank 
References 
4
0.42
4
Authors
3
Name
Order
Citations
PageRank
Arunav Mishra1455.73
Sairam Gurajada21187.83
Martin Theobald3147472.06