Title | ||
---|---|---|
A comparison of segmentation methods and extended lexicon models for Arabic statistical machine translation |
Abstract | ||
---|---|---|
In this article, we investigate different methodologies of Arabic segmentation for statistical machine translation by comparing a rule-based segmenter to different statistically-based segmenters. We also present a method for segmentation that serves the needs of a real-time translation system without impairing the translation accuracy. Second, we report on extended lexicon models based on triplets that incorporate sentence-level context during the decoding process. Results are presented on different translation tasks that show improvements in both BLEU and TER scores. |
Year | DOI | Venue |
---|---|---|
2012 | 10.1007/s10590-011-9102-0 | Machine Translation |
Keywords | Field | DocType |
segmentation method,statistical machine translation,different statistically-based segmenters,arabic statistical machine translation,different methodology,translation accuracy,arabic segmentation,different translation task,real-time translation system,ter score,extended lexicon model,decoding process,segmentation | Rule-based machine translation,Example-based machine translation,BLEU,Computer science,Evaluation of machine translation,Segmentation,Machine translation,Speech recognition,Lexicon,Artificial intelligence,Natural language processing,Decoding methods | Journal |
Volume | Issue | ISSN |
26 | 1-2 | 1573-0573 |
Citations | PageRank | References |
0 | 0.34 | 25 |
Authors | ||
3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Sasa Hasan | 1 | 245 | 17.35 |
Saab Mansour | 2 | 87 | 9.40 |
Hermann Ney | 3 | 14178 | 1506.93 |