Abstract | ||
---|---|---|
All of the previous syllable based Automatic Speech Recognizers (ASRs) for the Amharic language are built by training a separate acoustic model for each of the 196 distinctly pronounced Consonant-Vowel (CV) syllable. In this paper, we will demonstrate that a smaller number of acoustic models are sufficient to build a syllable based, speaker independent, continuous, Amharic ASR. It is built for weather forecast and business report applications using the UASR (Unified Approach to Speech Synthesis and Recognition) Tool kit. A new speech corpus, which is of more than 35 hours duration, is used for training. It is a collection of corpora recorded in three different environments in order to make the recognizer less sensitive to recording environment and microphone changes. The grammar is finite state transducer based and the lexical model consists of thousands of words. Though acoustic models for only 93 syllables are trained, a recognition accuracy of 93.26% is achieved on a test set that has 4,000 words collected from 10 speakers. |
Year | DOI | Venue |
---|---|---|
2013 | 10.1109/EUROCON.2013.6625203 | 2013 IEEE EUROCON |
Keywords | Field | DocType |
Amharic, ASR, UASR, CV-syllable, Finite State Transducers | Speech corpus,Speech synthesis,Computer science,Speech recognition,Syllable,Natural language processing,Artificial intelligence,Amharic,VoxForge,Microphone,Test set,Acoustic model | Conference |
Citations | PageRank | References |
0 | 0.34 | 4 |
Authors | ||
4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Yitagessu Birhanu Gebremedhin | 1 | 1 | 0.69 |
Frank Duckhorn | 2 | 9 | 3.84 |
Rüdiger Hoffmann | 3 | 105 | 26.70 |
Ivan Kraljevski | 4 | 7 | 4.00 |