Title
Adoption of the Linked Data Best Practices in Different Topical Domains
Abstract
The central idea of Linked Data is that data publishers support applications in discovering and integrating data by complying to a set of best practices in the areas of linking, vocabulary usage, and metadata provision. In 2011, the State of the LOD Cloud report analyzed the adoption of these best practices by linked datasets within different topical domains. The report was based on information that was provided by the dataset publishers themselves via the datahub.io Linked Data catalog. In this paper, we revisit and update the findings of the 2011 State of the LOD Cloud report based on a crawl of the Web of Linked Data conducted in April 2014. We analyze how the adoption of the different best practices has changed and present an overview of the linkage relationships between datasets in the form of an updated LOD cloud diagram, this time not based on information from dataset providers, but on data that can actually be retrieved by a Linked Data crawler. Among others, we find that the number of linked datasets has approximately doubled between 2011 and 2014, that there is increased agreement on common vocabularies for describing certain types of entities, and that provenance and license metadata is still rarely provided by the data sources.
Year
DOI
Venue
2014
10.1007/978-3-319-11964-9_16
Semantic Web Conference
Keywords
Field
DocType
best practices,linked open data,web of linked data
Metadata,Data mining,World Wide Web,Best practice,Computer science,Linked data,Vocabulary,Web crawler,Database,License,Cloud computing
Conference
Volume
ISSN
Citations 
8796
0302-9743
154
PageRank 
References 
Authors
4.75
6
3
Search Limit
100154
Name
Order
Citations
PageRank
Max Schmachtenberg11615.23
Christian Bizer28448524.93
Heiko Paulheim3109584.19