Abstract | ||
---|---|---|
We present an annotation study on a representative dataset of literal and idiomatic uses of infinitive-verb compounds in German newspaper and journal texts. Infinitive-verb compounds form a challenge for writers of German, because spelling regulations are different for literal and idiomatic uses. Through the participation of expert lexicographers we were able to obtain a high-quality corpus resource which is offered as a testbed for automatic idiomaticity detection and coarse-grained word-sense disambiguation. We trained a classifier on the corpus which was able to distinguish literal and idiomatic uses with an accuracy of 85%. |
Year | Venue | Keywords |
---|---|---|
2016 | LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | Corpus Annotation,Semantics,Idiom Detection |
Field | DocType | Citations |
Verb,Computer science,Natural language processing,Artificial intelligence,Infinitive,German | Conference | 0 |
PageRank | References | Authors |
0.34 | 0 | 9 |
Name | Order | Citations | PageRank |
---|---|---|---|
Andrea Horbach | 1 | 22 | 7.23 |
Andrea Hensler | 2 | 0 | 0.34 |
Sabine Krome | 3 | 0 | 0.34 |
Jakob Prange | 4 | 0 | 0.34 |
Werner Scholze-Stubenrecht | 5 | 0 | 0.68 |
Diana Steffen | 6 | 0 | 0.68 |
Stefan Thater | 7 | 756 | 38.54 |
Christian Wellner | 8 | 0 | 0.34 |
Manfred Pinkal | 9 | 1116 | 69.77 |