Abstract | ||
---|---|---|
In this paper, we present results of non-uniform vowel normalization and show that the frequency-warping necessary to do nonuniform vowel nonnalization is similar to the mel-scale. We compare our methods to Fant's non-uniform vowel normalization method and show that with proposed frequency warping approach we can achieve similar performance without any knowledge of the spoken vowel and the fonnant number. The proposed approach is motivated by a desire to perform non-uniform speaker normalization in automatic speech recognition systems. We also present results of a more comprehensive study of our earlier work on non-uniform scaling which again shows that mel-scale is the appropriate warping function. All the results in this paper are based on data from Peterson & Barney and Hillenbrand et al. vowel databases. |
Year | DOI | Venue |
---|---|---|
2002 | 10.1109/ICASSP.2002.5743768 | ICASSP), 2002 IEEE International Conference |
Keywords | Field | DocType |
bismuth,automatic speech recognition,frequency domain analysis,length measurement,gold,databases | Frequency domain,Image warping,Normalization (statistics),Pattern recognition,Computer science,Length measurement,Speech recognition,Vowel,Artificial intelligence,Scaling | Conference |
Volume | ISSN | ISBN |
1 | 1520-6149 | 0-7803-7402-9 |
Citations | PageRank | References |
3 | 0.48 | 1 |
Authors | ||
5 |
Name | Order | Citations | PageRank |
---|---|---|---|
S. Umesh | 1 | 141 | 15.66 |
S. V. Bharath Kumar | 2 | 13 | 3.38 |
M. K. Vinay | 3 | 3 | 0.48 |
Sharma, Rajesh | 4 | 3 | 0.48 |
Rohit Sinha | 5 | 231 | 30.54 |