Title
Wikidition: Automatic Lexiconization And Linkification Of Text Corpora
Abstract
We introduce a new text technology, called Wikidition, which automatically generates large scale editions of corpora of natural language texts. Wikidition combines a wide range of text mining tools for automatically linking lexical, sentential and textual units. This includes the extraction of corpus-specific lexica down to the level of syntactic words and their grammatical categories. To this end, we introduce a novel measure of text reuse and exemplify Wikidition by means of the capitularies, that is, a corpus of Medieval Latin texts.
Year
DOI
Venue
2016
10.1515/itit-2015-0035
IT-INFORMATION TECHNOLOGY
Keywords
Field
DocType
Wikidition, linkification, lexiconization, digital edition, text mining
Text mining,Digital edition,Computer science,Text corpus,Natural language processing,Artificial intelligence,Embedded system
Journal
Volume
Issue
ISSN
58
2
1611-2776
Citations 
PageRank 
References 
1
0.37
0
Authors
6
Name
Order
Citations
PageRank
Alexander Mehler118636.63
Rüdiger Gleim2396.27
Tim Vor Der Brück3288.50
Wahed Hemati411.73
Tolga Uslu510.37
Steffen Eger67725.00