Title
Incremental translation using hierarchichal phrase-based translation system
Abstract
Hierarchical phrase-based machine translation [1] (Hiero) is a prominent approach for Statistical Machine Translation usually comparable to or better than conventional phrase-based systems. But Hiero typically uses the CKY decoding algorithm which requires the entire input sentence before decoding begins, as it produces the translation in a bottom-up fashion. Left-to-right (LR) decoding [2] is a promising decoding algorithm for Hiero that produces the output translation in left to right order. In this paper we focus on simultaneous translation using the Hiero translation framework. In simultaneous translation, translations are generated incrementally as source language speech input is processed. We propose a novel approach for incremental translation by integrating segmentation and decoding in LR-Hiero. We compare two incremental decoding algorithms for LR-Hiero and present translation quality scores (BLEU) and the latency of generating translations for both decoders on audio lectures from the TED collection.
Year
DOI
Venue
2014
10.1109/SLT.2014.7078552
Spoken Language Technology Workshop
Keywords
Field
DocType
language translation,natural language processing,speech coding,statistical analysis,BLEU,CKY decoding algorithm,Hiero translation framework,LR decoding,LR-Hiero,TED collection,audio lectures,hierarchical phrase-based machine translation system,incremental decoding algorithm,incremental translation,left-to-right decoding,segmentation,source language speech input,statistical machine translation,translation quality scores,Hierarchical Phrase-based Translation (Hiero),Incremental Decoding,Left-to-Right Decoding,Statistical Machine Translation (SMT)
Rule-based machine translation,Example-based machine translation,Computer science,Machine translation,Phrase,Synchronous context-free grammar,Natural language processing,Artificial intelligence,Pattern recognition,Segmentation,Speech recognition,Decoding methods,Sentence
Conference
ISSN
Citations 
PageRank 
2639-5479
0
0.34
References 
Authors
16
4
Name
Order
Citations
PageRank
Maryam Siahbani1294.53
Ramtin Mehdizadeh Seraj200.34
Baskaran Sankaran315513.65
Anoop Sarkar4101788.82