Title | ||
---|---|---|
Collocations in a Rule-Based MT System: A Case Study Evaluation of Their Translation Adequacy |
Abstract | ||
---|---|---|
Collocations constitute a subclass of multi-word expressions that are particu- larly problematic for machine translation, due 1) to their omnipresence in texts, and 2) to their morpho-syntactic proper- ties, allowing virtually unlimited variation and leading to long-distance dependen- cies. Since existing MT systems incorpo- rate mostly local information, these are ar- guably ill-suited for handling those collo- cations whose items are not found in close proximity. In this article, we describe an integrated environment in which colloca- tions (and possibly their translation equiv- alents) are first identified from text cor- pora and stored in the lexical database of a translation system, then they are employed by this system, which is capable of deal- ing with syntactic transformations as it is based on a deep linguistic approach. We compare the performance of our system (in terms of collocation translation adequacy) with that of two major MT systems, one statistical, and the other rule-based. Our results confirm that syntactic variation af- fects translation quality and show that a deep syntactic approach is more robust in this sense, especially for languages with freer word order (e.g., German) and richer morphology (e.g., Italian) than English. |
Year | Venue | Keywords |
---|---|---|
2009 | EAMT | machine translation,word order,rule based |
DocType | Citations | PageRank |
Conference | 4 | 0.41 |
References | Authors | |
6 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
eric wehrli | 1 | 29 | 6.20 |
violeta seretan | 2 | 92 | 11.52 |
luka nerima | 3 | 82 | 11.43 |
Lorenza Russo | 4 | 7 | 1.56 |