Title
Guidelines for multilingual linked data
Abstract
In this article, we argue that there is a growing number of linked datasets in different natural languages, and that there is a need for guidelines and mechanisms to ensure the quality and organic growth of this emerging multilingual data network. However, we have little knowledge regarding the actual state of this data network, its current practices, and the open challenges that it poses. Questions regarding the distribution of natural languages, the links that are established across data in different languages, or how linguistic features are represented, remain mostly unanswered. Addressing these and other language-related issues can help to identify existing problems, propose new mechanisms and guidelines or adapt the ones in use for publishing linked data including language-related features, and, ultimately, provide metrics to evaluate quality aspects. In this article we review, discuss, and extend current guidelines for publishing linked data by focusing on those methods, techniques and tools that can help RDF publishers to cope with language barriers. Whenever possible, we will illustrate and discuss each of these guidelines, methods, and tools on the basis of practical examples that we have encountered in the publication of the datos.bne.es dataset.
Year
DOI
Venue
2013
10.1145/2479787.2479867
WIMS
Keywords
Field
DocType
current practice,multilingual data network,quality aspect,language-related feature,language-related issue,current guideline,data network,natural language,different natural language,different language,semantic web,linked data
Language barrier,Data science,Data mining,Computer science,Semantic Web,Linked data,Natural language,Publishing,Organic growth,RDF
Conference
Citations 
PageRank 
References 
11
0.55
23
Authors
5
Name
Order
Citations
PageRank
Asunción Gómez-Pérez12038201.05
Daniel Vila-Suero2544.91
Elena Montiel-Ponsoda328823.18
Jorge Gracia448838.46
Guadalupe Aguado de Cea515516.08