Title
In search of the ur-Wikipedia: universality, similarity, and translation in the Wikipedia inter-language link network.
Abstract
Wikipedia has become one of the primary encyclopaedic information repositories on the World Wide Web. It started in 2001 with a single edition in the English language and has since expanded to more than 20 million articles in 283 languages. Criss-crossing between the Wikipedias is an inter-language link network, connecting the articles of one edition of Wikipedia to another. We describe characteristics of articles covered by nearly all Wikipedias and those covered by only a single language edition, we use the network to understand how we can judge the similarity between Wikipedias based on concept coverage, and we investigate the flow of translation between a selection of the larger Wikipedias. Our findings indicate that the relationships between Wikipedia editions follow Tobler's first law of geography: similarity decreases with increasing distance. The number of articles in a Wikipedia edition is found to be the strongest predictor of similarity, while language similarity also appears to have an influence. The English Wikipedia edition is by far the primary source of translations. We discuss the impact of these results for Wikipedia as well as user-generated content communities in general.
Year
DOI
Venue
2012
10.1145/2462932.2462959
WikiSym
Keywords
Field
DocType
primary encyclopaedic information repository,similarity decrease,wikipedia inter-language link network,single language edition,english language,english wikipedia edition,inter-language link network,wikipedia edition,language similarity,larger wikipedias,single edition,wikipedia,first law of geography
World Wide Web,English language,Computer science,Tobler's first law of geography,Universality (philosophy)
Conference
Citations 
PageRank 
References 
7
0.53
22
Authors
4
Name
Order
Citations
PageRank
Morten Warncke-Wang1843.81
Anuradha Uduwage2653.36
Zhenhua Dong3919.03
John Riedl4149481512.77