Title | ||
---|---|---|
Inducing bilingual lexicons from small quantities of sentence-aligned phonemic transcriptions. |
Abstract | ||
---|---|---|
We investigate induction of a bilingual lexicon from a corpus of phonemic transcriptions that have been sentence-aligned with English translations. We evaluate existing models that have been used for this purpose and report on two additional models, which demonstrate performance improvements. The first performs monolingual segmentation followed by alignment, while the second performs both tasks jointly. We show that monolingual and bilingual lexical entries can be learnt with high precision from corpora having just 1k–10k sentences. We explain how our results support the application of alignment algorithms to the task of documenting endangered languages. |
Year | Venue | DocType |
---|---|---|
2015 | IWSLT | Conference |
Citations | PageRank | References |
0 | 0.34 | 0 |
Authors | ||
4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Oliver Adams | 1 | 8 | 1.84 |
Graham Neubig | 2 | 989 | 130.31 |
Trevor Cohn | 3 | 1649 | 110.69 |
Steven Bird | 4 | 173 | 64.42 |