Abstract | ||
---|---|---|
In this paper we investigate a technique to find out vocal source based features from the LP residual of speech signal for automatic speaker identification. Autocorrelation with some specific lag is computed for the residual signal to derive these features. Compared to traditional features like MFCC, PLPCC which represent vocal tract information, these features represent complementary vocal cord information. Our experiment in fusing these two sources of information in representing speaker characteristics yield better speaker identification accuracy. We have used Gaussian mixture model (GMM) based speaker modeling and results are shown on two public databases to validate our proposition. |
Year | Venue | Keywords |
---|---|---|
2011 | Clinical Orthopaedics and Related Research | gaussian mixture model,vocal tract,human computer interaction |
Field | DocType | Volume |
Residual,Mel-frequency cepstrum,Speaker identification,Pattern recognition,Computer science,Speech recognition,Speaker recognition,Speaker diarisation,Artificial intelligence,Mixture model,Vocal tract,Autocorrelation | Journal | abs/1105.2 |
ISSN | Citations | PageRank |
International Journal of Communication Engineering Applications,
Volume: 1, Page: 5-11, Year: 2011, Month: February | 0 | 0.34 |
References | Authors | |
6 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Md. Sahidullah | 1 | 326 | 24.99 |
Goutam Saha | 2 | 11 | 2.21 |