Title
Active knowledge: dynamically enriching RDF knowledge bases by web services
Abstract
The proliferation of knowledge-sharing communities and the advances in information extraction have enabled the construction of large knowledge bases using the RDF data model to represent entities and relationships. However, as the Web and its latently embedded facts evolve, a knowledge base can never be complete and up-to-date. On the other hand, a rapidly increasing suite of Web services provide access to timely and high-quality information, but this is encapsulated by the service interface. We propose to leverage the information that could be dynamically obtained from Web services in order to enrich RDF knowledge bases on the fly whenever the knowledge base does not suffice to answer a user query. To this end, we develop a sound framework for appropriately generating queries to encapsulated Web services and efficient algorithms for query execution and result integration. The query generator composes sequences of function calls based on the available service interfaces. As Web service calls are expensive, our method aims to reduce the number of calls in order to retrieve results with sufficient recall. Our approach is fully implemented in a complete prototype system named ANGIE1. The user can query and browse the RDF knowledge base as if it already contained all the facts from the Web services. This data, however, is gathered and integrated on the fly, transparently to the user. We demonstrate the viability and efficiency of our approach in experiments based on real-life data provided by popular Web services.
Year
DOI
Venue
2010
10.1145/1807167.1807212
SIGMOD Conference
Keywords
Field
DocType
encapsulated web service,web service call,rdf knowledge base,popular web service,active knowledge,knowledge base,query execution,web service,large knowledge base,user query,query generator composes sequence,information integration,information extraction,rdf,warehousing,data model,semantics
Web search query,Data mining,RDF query language,World Wide Web,Computer science,Web query classification,SPARQL,Web modeling,Social Semantic Web,Web service,Database,WS-Policy
Conference
Citations 
PageRank 
References 
23
1.06
38
Authors
6
Name
Order
Citations
PageRank
Nicoleta Preda117314.40
Gjergji Kasneci22407123.08
Fabian M. Suchanek33900188.75
Thomas Neumann42523156.50
Wenjun Yuan54910.21
Gerhard Weikum6127102146.01