Title
Mining the Web of Linked Data with RapidMiner
Abstract
Lots of data from different domains are published as Linked Open Data (LOD). While there are quite a few browsers for such data, as well as intelligent tools for particular purposes, a versatile tool for deriving additional knowledge by mining the Web of Linked Data is still missing. In this system paper, we introduce the RapidMiner Linked Open Data extension. The extension hooks into the powerful data mining and analysis platform RapidMiner, and offers operators for accessing Linked Open Data in RapidMiner, allowing for using it in sophisticated data analysis workflows without the need for expert knowledge in SPARQL or RDF. The extension allows for autonomously exploring the Web of Data by following links, thereby discovering relevant datasets on the fly, as well as for integrating overlapping data found in different datasets. As an example, we show how statistical data from the World Bank on scientific publications, published as an RDF data cube, can be automatically linked to further datasets and analyzed using additional background knowledge from ten different LOD datasets.
Year
DOI
Venue
2015
10.1016/j.websem.2015.06.004
Journal of Web Semantics
Keywords
Field
DocType
Linked Open Data,Data mining,RapidMiner
Data mining,World Wide Web,Information retrieval,Computer science,On the fly,Linked data,SPARQL,Workflow,RDF,Data cube
Journal
Volume
Issue
ISSN
35
P3
1570-8268
Citations 
PageRank 
References 
23
1.05
33
Authors
3
Name
Order
Citations
PageRank
petar ristoski125621.36
Christian Bizer28448524.93
Heiko Paulheim3109584.19