Abstract | ||
---|---|---|
We describe Joshua, an open source toolkit for statistical machine translation. Joshua implements all of the algorithms required for synchronous context free grammars (SCFGs): chart-parsing, n-gram language model integration, beam-and cube-pruning, and k-best extraction. The toolkit also implements suffix-array grammar extraction and minimum error rate training. It uses parallel and distributed computing techniques for scalability. We demonstrate that the toolkit achieves state of the art translation performance on the WMT09 French-English translation task. |
Year | Venue | Keywords |
---|---|---|
2009 | WMT@EACL | free grammar,open source toolkit,statistical machine translation,parsing-based machine translation,n-gram language model integration,art translation performance,k-best extraction,wmt09 french-english translation task,minimum error rate training,suffix-array grammar extraction,beam-and cube-pruning |
Field | DocType | Citations |
Context-free grammar,Programming language,Computer science,Word error rate,Machine translation,Grammar,Natural language processing,Artificial intelligence,Computer-assisted translation,Parsing,Language model,Scalability | Conference | 106 |
PageRank | References | Authors |
6.00 | 21 | 9 |
Name | Order | Citations | PageRank |
---|---|---|---|
Zhifei Li | 1 | 339 | 23.83 |
Chris Callison-Burch | 2 | 4872 | 259.75 |
chris dyer | 3 | 5438 | 232.28 |
Juri Ganitkevitch | 4 | 659 | 32.71 |
Sanjeev Khudanpur | 5 | 2155 | 202.00 |
Lane Schwartz | 6 | 209 | 18.01 |
Wren N. G. Thornton | 7 | 124 | 7.74 |
Jonathan Weese | 8 | 325 | 19.11 |
Omar Zaidan | 9 | 787 | 43.28 |