Abstract | ||
---|---|---|
We introduce a new text technology, called Wikidition, which automatically generates large scale editions of corpora of natural language texts. Wikidition combines a wide range of text mining tools for automatically linking lexical, sentential and textual units. This includes the extraction of corpus-specific lexica down to the level of syntactic words and their grammatical categories. To this end, we introduce a novel measure of text reuse and exemplify Wikidition by means of the capitularies, that is, a corpus of Medieval Latin texts. |
Year | DOI | Venue |
---|---|---|
2016 | 10.1515/itit-2015-0035 | IT-INFORMATION TECHNOLOGY |
Keywords | Field | DocType |
Wikidition, linkification, lexiconization, digital edition, text mining | Text mining,Digital edition,Computer science,Text corpus,Natural language processing,Artificial intelligence,Embedded system | Journal |
Volume | Issue | ISSN |
58 | 2 | 1611-2776 |
Citations | PageRank | References |
1 | 0.37 | 0 |
Authors | ||
6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Alexander Mehler | 1 | 186 | 36.63 |
Rüdiger Gleim | 2 | 39 | 6.27 |
Tim Vor Der Brück | 3 | 28 | 8.50 |
Wahed Hemati | 4 | 1 | 1.73 |
Tolga Uslu | 5 | 1 | 0.37 |
Steffen Eger | 6 | 77 | 25.00 |