Title
Open-Source portuguese–spanish machine translation
Abstract
This paper describes the current status of development of an open-source shallow-transfer machine translation (MT) system for the [European] Portuguese $\leftrightarrow$ Spanish language pair, developed using the OpenTrad Apertium MT toolbox (www.apertium.org). Apertium uses finite-state transducers for lexical processing, hidden Markov models for part-of-speech tagging, and finite-state-based chunking for structural transfer, and is based on a simple rationale: to produce fast, reasonably intelligible and easily correctable translations between related languages, it suffices to use a MT strategy which uses shallow parsing techniques to refine word-for-word MT. This paper briefly describes the MT engine, the formats it uses for linguistic data, and the compilers that convert these data into an efficient format used by the engine, and then goes on to describe in more detail the pilot Portuguese$\leftrightarrow$Spanish linguistic data.
Year
DOI
Venue
2006
10.1007/11751984_6
PROPOR
Keywords
Field
DocType
opentrad apertium mt toolbox,mt strategy,paper briefly,word-for-word mt,spanish linguistic data,correctable translation,spanish language pair,spanish machine translation,pilot portuguese,linguistic data,mt engine,open-source portuguese,hidden markov model,finite state transducer,machine translation
Shallow parsing,Computer science,Machine translation,Toolbox,Portuguese,Compiler,Natural language processing,Chunking (psychology),Transfer-based machine translation,Artificial intelligence,Hidden Markov model
Conference
Volume
ISSN
ISBN
3960
0302-9743
3-540-34045-9
Citations 
PageRank 
References 
29
1.82
4
Authors
10