Title
ON THE STATISTICAL ESTIMATION OF STOCHASTIC FINITE-STATE TRANSDUCERS IN MACHINE TRANSLATION
Abstract
The inference of finite-state transducers from bilingual training data plays an important role in many natural-language tasks and mainly in machine translation. However, there are only a few techniques to infer such models. One of these techniques is the grammatical inference and alignments for transducer inference (GIATI) technique that has proven to be very adequate for speech translation, text-input machine translation, or computer-assisted translation. GIATI is a heuristic technique that requires segmented training data (i.e., the input sentences and the output sentences must be segmented with the restriction that the input segments and the output segments must be monotone aligned). For the purpose of obtaining segmented training data, pure statistical word-alignment models are used. This technique is revisited in this article. The main goal is to formally derive the complete GIATI technique using classical expectation-maximization statistical estimation procedure. This new approach allows us to avoid a hard dependence on heuristic "external" statistical techniques (statistical alignments and n-grams). A first set of experimental results obtained in a machine-translation task are also reported to initially validate this new version of the inference technique of finite-state transducers.
Year
DOI
Venue
2008
10.1080/08839510701853051
Applied Artificial Intelligence
Keywords
Field
DocType
grammatical inference,finite-state transducers,complete giati technique,statistical technique,stochastic finite-state transducers,inference technique,segmented training data,machine translation,heuristic technique,statistical estimation,classical expectation-maximization statistical estimation,computer-assisted translation,computer assisted translation,expectation maximization,natural language,finite state transducer
Rule-based machine translation,Example-based machine translation,Heuristic,Pattern recognition,Grammar induction,Computer science,Inference,Machine translation,Artificial intelligence,Transfer-based machine translation,Speech translation,Machine learning
Journal
Volume
Issue
ISSN
22
1-2
0883-9514
Citations 
PageRank 
References 
1
0.35
26
Authors
3
Name
Order
Citations
PageRank
Jesús Andrés-Ferrer1737.52
Francisco Casacuberta Nolla2184.98
Alfons Juan-Císcar332.17