Abstract | ||
---|---|---|
We present a study of aspects of discourse structure - specifically discourse devices used to organize information in a sentence - that significantly impact the quality of machine translation. Our analysis is based on manual evaluations of translations of news from Chinese and Arabic to English. We find that there is a particularly strong mismatch in the notion of what constitutes a sentence in Chinese and English, which occurs often and is associated with significant degradation in translation quality. Also related to lower translation quality is the need to employ multiple explicit discourse connectives (because, but, etc.), as well as the presence of ambiguous discourse connectives in the English translation. Furthermore, the mismatches between discourse expressions across languages significantly impact translation quality. |
Year | Venue | Field |
---|---|---|
2014 | PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2 | Rule-based machine translation,Example-based machine translation,Arabic,Expression (mathematics),Computer science,Machine translation,Natural language processing,Artificial intelligence,Dynamic and formal equivalence,Linguistics,Sentence,Discourse structure |
DocType | Volume | Citations |
Conference | P14-2 | 8 |
PageRank | References | Authors |
0.59 | 17 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Junyi Jessy Li | 1 | 49 | 15.72 |
Marine Carpuat | 2 | 587 | 51.99 |
Ani Nenkova | 3 | 1831 | 109.14 |