Title | ||
---|---|---|
Speaker-independent word recognition in connected speech on the basis of Phoneme recognition |
Abstract | ||
---|---|---|
A method of speaker-independent connected-word recognition by robust segmentation for speaker variation is described. To normalize the variation by speakers, an input speech pattern is transformed through segmentation and labeling into a sequence of phonemically labeled segments (phoneme string) which have less variation by speakers. Connected word recognition is carried out using a two-level DP matching algorithm on that phoneme string. The input speech pattern is oversegmented in order to avoid omissions which cause fatal errors in word recognition. The number of segments which correspond to one phoneme should depend on the phoneme; the number of segments for vowels should be greater than that for consonants. From this viewpoint, we propose a method of varying the matching path adaptively with respect to each phoneme, at the dynamic-programming word-matching level. In experiments on spokenword recognition of one to four connected digits, the recognition rate for each word was about 90% and for each sequence of words was about 80%, on an average over seven male speakers. In the case where the words are spoken clearly, the former improved to 93.8% and the latter to 86.0% on an average. |
Year | DOI | Venue |
---|---|---|
1984 | 10.1016/0020-0255(84)90041-0 | Inf. Sci. |
Keywords | Field | DocType |
connected speech,phoneme recognition,speaker-independent word recognition,word recognition | Connected speech,Normalization (statistics),Pattern recognition,Audio mining,Segmentation,Computer science,Word error rate,Word recognition,Speech recognition,Speaker recognition,Artificial intelligence,Blossom algorithm | Journal |
Volume | Issue | ISSN |
33 | 1-2 | 0020-0255 |
Citations | PageRank | References |
0 | 0.34 | 0 |
Authors | ||
3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Kiyoshi Maenobu | 1 | 61 | 22.26 |
Yasuo Ariki | 2 | 519 | 88.94 |
toshiyuki sakai | 3 | 13 | 2.75 |