Abstract | ||
---|---|---|
Language modeling for large-vocabulary conversational Arabic speech recognition is faced with the problem of the complex morphology of Arabic, which increases the perplexity and out-of-vocabulary rate. This problem is compounded by the enormous dialectal variability and differences between spoken and written language. In this paper, we investigate improvements in Arabic language modeling by developing various morphology-based language models. We present four different approaches to morphology-based language modeling, including a novel technique called factored language models. Experimental results are presented for both rescoring and first-pass recognition experiments. |
Year | DOI | Venue |
---|---|---|
2006 | 10.1016/j.csl.2005.10.001 | Computer Speech & Language |
Keywords | Field | DocType |
morphology,arabic,speech recognition,language modeling,arabic language,language model | Perplexity,Computer science,Computational linguistics,Written language,Speech recognition,Natural language,Universal Networking Language,Artificial intelligence,Natural language processing,Language identification,Language model,Language technology | Journal |
Volume | Issue | ISSN |
20 | 4 | 0885-2308 |
Citations | PageRank | References |
47 | 1.73 | 25 |
Authors | ||
5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Katrin Kirchhoff | 1 | 1026 | 95.24 |
Dimitra Vergyri | 2 | 373 | 36.97 |
Jeff Bilmes | 3 | 3420 | 289.94 |
Kevin Duh | 4 | 819 | 72.94 |
Andreas Stolcke | 5 | 6690 | 712.46 |