Title
Recoin: Relative Completeness in Wikidata.
Abstract
The collaborative knowledge base Wikidata is the central storage of Wikimedia projects, containing over 45 million data items. It acts as the hub for interlinking Wikipedia pages about a specific item in different languages, automates features such as infoboxes in Wikipedia, and is increasingly used for other applications such as data enrichment and question answering. Tracking the quality of Wikidata is an important issue for this project. In this paper we focus particularly on the completeness aspect. Several automated techniques have been adopted by Wikis to track and manage completeness, yet these techniques are generally subjective and do not provide a clear quality estimate at the level of entities. In this paper, we present an approach towards measuring Relative Completeness in Wikidata by comparison with data present for similar entities. This relative completeness approach is easily scalable with the introduction of new classes in the knowledge base, and has been implemented for all available entities in Wikidata. The results provide an intuition on the completeness of an entity comparing it with other similar entities. Here, we present our implementation approach along with a discussion on strategies and open challenges.
Year
DOI
Venue
2018
10.1145/3184558.3191641
WWW '18: The Web Conference 2018 Lyon France April, 2018
DocType
ISBN
Citations 
Conference
978-1-4503-5640-4
4
PageRank 
References 
Authors
0.42
0
3
Name
Order
Citations
PageRank
Vevake Balaraman193.52
Simon Razniewski215727.07
Werner Nutt32009395.43