Title
Unsupervised Ranked Cross-Lingual Lexical Substitution for Low-Resource Languages.
Abstract
We propose an unsupervised system for a variant of cross-lingual lexical substitution (CLLS) to be used in a reading scenario in computer-assisted language learning (CALL), in which single-word translations provided by a dictionary are ranked according to their appropriateness in context. In contrast to most alternative systems, ours does not rely on either parallel corpora or machine translation systems, making it suitable for low-resource languages as the language to be learned. This is achieved by a graph-based scoring mechanism which can deal with ambiguous translations of context words provided by a dictionary. Due to this decoupling from the source language, we need monolingual corpus resources only for the target language, i.e. the language of the translation candidates. We evaluate our approach for the language pair Norwegian Nynorsk-English on an exploratory manually annotated gold standard and report promising results. When running our system on the original SemEval CLLS task, we rank 6th out of 18 (including 2 baselines and our 2 system variants) in the best evaluation.
Year
Venue
Keywords
2016
LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION
cross-lingual lexical substitution,low-resource languages,computer-assisted language learning
Field
DocType
Citations 
Cross lingual,Ranking,Computer science,Speech recognition,Natural language processing,Artificial intelligence
Conference
0
PageRank 
References 
Authors
0.34
0
3
Name
Order
Citations
PageRank
Stefan Ecker100.34
Andrea Horbach2227.23
Stefan Thater375638.54