Title | ||
---|---|---|
Improved i-vector extraction technique for speaker verification with short utterances. |
Abstract | ||
---|---|---|
A major challenge in ASV is to improve performance with short speech segments for end-user convenience in real-world applications. In this paper, we present a detailed analysis of ASV systems to observe the duration variability effects on state-of-the-art i-vector and classical Gaussian mixture model-universal background model (GMM-UBM) based ASV systems. We observe an increase in uncertainty of model parameter estimation for i-vector based ASV with speech of shorter duration. In order to compensate the effect of duration variability in short utterances, we have proposed adaptation technique for Baum-Welch statistics estimation used to i-vector extraction. Information from pre-estimated background model parameters are used for adaptation method. The ASV performance with the proposed approach is considerably superior to the conventional i-vector based system. Furthermore, the fusion of proposed i-vector based system and GMM-UBM further improves the ASV performance, especially for short speech segments. Experiments conducted on two speech corpora, NIST SRE 2008 and 2010, have shown relative improvement in equal error rate (EER) in the range of 12–20%. |
Year | DOI | Venue |
---|---|---|
2018 | 10.1007/s10772-017-9477-2 | I. J. Speech Technology |
Keywords | Field | DocType |
Speaker recognition, i-Vector, GMM-UBM, Short utterance, Duration variability, Baum–Welch statistics | I vector,Speaker verification,Pattern recognition,Computer science,Word error rate,Speech recognition,Speaker recognition,NIST,Gaussian,Artificial intelligence,Model parameter | Journal |
Volume | Issue | ISSN |
21 | 3 | 1381-2416 |
Citations | PageRank | References |
1 | 0.34 | 31 |
Authors | ||
3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Arnab Poddar | 1 | 6 | 2.13 |
Md. Sahidullah | 2 | 326 | 24.99 |
Goutam Saha | 3 | 255 | 23.17 |