Abstract | ||
---|---|---|
A phonetic typewriter that utilizes the underlying statistical structure of phoneme/character sequences is described. The syllable/character trigram approach to language modeling is adopted to make language source models. These are obtained by calculating trigram probabilities, using a large text database. The phonetic typewriter is tested using 279 phrases uttered by one male speaker, and the syllable source model achieves a 94.9% phoneme recognition rate with the test-set phoneme perplexity of 3.9. Without the syllable trigram, the phoneme recognition rate is only 73.2%. A trigram model based on characters is also evaluated. This model can reduce the phoneme perplexity significantly compared with that of the syllable trigram. |
Year | DOI | Venue |
---|---|---|
1991 | 10.1109/ICASSP.1991.150304 | ICASSP '91 Proceedings of the Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference |
Keywords | DocType | ISSN |
speech recognition,typewriters,voice equipment,automatic speech recognition,character sequences,character trigram,language modeling,language source models,phoneme recognition rate,phoneme source modeling,phonetic typewriter,syllable trigram,test-set phoneme perplexity,text database,trigram model,trigram probabilities | Conference | 1520-6149 |
ISBN | Citations | PageRank |
0-7803-0003-3 | 3 | 0.53 |
References | Authors | |
4 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Yamada, T. | 1 | 3 | 0.53 |
Hanazawa, T. | 2 | 5 | 0.93 |
Kawabata, T. | 3 | 37 | 4.67 |
Matsunaga, S. | 4 | 477 | 49.70 |