Abstract | ||
---|---|---|
This paper presents an early study on building Vietnamese large vocabulary continuous speech recognition with concentration on choosing type of units and feature set. Our experiments were done using the HTK Toolkit and VOV broadcast corpus. The results show that the recognizer with mixture units achieved better performance than recognizers with initial-final units or phoneme units. Among feature sets applied to the mixture unit recognizer, MFCC has performance somewhat better than PLP, and the combination of MFCC and F0 features increases the accuracy of the Vietnamese recognition system. |
Year | Venue | Field |
---|---|---|
2005 | INTERSPEECH | Computer science,Speech recognition,Vietnamese,Vocabulary |
DocType | Citations | PageRank |
Conference | 4 | 0.60 |
References | Authors | |
2 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Thang Tat Vu | 1 | 11 | 4.59 |
Dung Tien Nguyen | 2 | 18 | 4.33 |
Luong Chi Mai | 3 | 22 | 4.54 |
John-Paul Hosom | 4 | 231 | 23.43 |