Bilingual aligned corpora for speech to speech translation for Spanish, English and Catalan - Citegraph

Paper Info

Title
Bilingual aligned corpora for speech to speech translation for Spanish, English and Catalan

Abstract
In the framework of the EU-funded Project LC-STAR, a set of Language Resources (LR) for all the Speech to Speech Translation components (Speech recognition, Machine Translation and Speech Synthesis) was developed. This paper deals with the development of bilingual corpora in Spanish, US English and Catalan. The corpora were obtained from spontaneous dialogues in one of these three languages which were translated to the other two languages. The paper describes the translation methodology, specific problems of translating spontaneous dialogues to be used for MT training, formats and the validation criteria.

Year	Venue	Keywords
2005	INTERSPEECH	speech synthesis,machine translation,speech recognition
Field	DocType	Citations
Rule-based machine translation,Speech corpus,Example-based machine translation,Evaluation of machine translation,Computer science,Machine translation,Machine translation software usability,Artificial intelligence,Natural language processing,Computer-assisted translation,Speech synthesis,Speech recognition,Linguistics	Conference	0
PageRank	References	Authors
0.34	0	6

Authors (6 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
David Conejero	1	15	2.61
Alan Lounds	2	0	0.34
Carmen García-Mateo	3	175	34.39
Leandro Rodríguez Liñares	4	42	7.29
Raquel Mochales	5	0	0.34
Asunción Moreno	6	399	44.97

1