Title
Acquiring a Taxonomy from the German Wikipedia
Abstract
This paper presents the process of acquiring a large, domain independent, taxonomy from the German Wikipedia. We build upon a previously implemented platform that extracts a semantic network and taxonomy from the English version of the Wikipedia. We describe two accomplishments of our work: the semantic network for the German language in which isa links are identified and annotated, and an expansion of the platform for easy adaptation for a new language. We identify the platform's strengths and shortcomings, which stem from the scarcity of free processing resources for languages other than English. We show that the taxonomy induction process is highly reliable - evaluated against the German version of WordNet, GermaNet, the resource obtained shows an accuracy of 83.34%.
Year
Venue
Keywords
2008
SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008
semantic network
Field
DocType
Citations 
Scarcity,Computer science,Semantic network,Natural language processing,GermaNet,Artificial intelligence,Constructed language,WordNet,German
Conference
11
PageRank 
References 
Authors
0.61
7
3
Name
Order
Citations
PageRank
Laura Kassner1110.61
Vivi Nastase252341.30
Michael Strube32142137.32