Abstract | ||
---|---|---|
•We propose a new data-driven filterbank for speech feature extraction in automatic speaker verification (ASV).•The proposed method uses voice frames of speech signal for computing the frequency warping scale.•We have also proposed a new method for computing filterbank frequency-responses using principal component analysis (PCA).•We have conducted experiments on NIST SRE 2001, 2002 and VoxCeleb1 with different ASV systems.•The proposed data-driven cepstral features yield improved recognition performance over baseline methods. |
Year | DOI | Venue |
---|---|---|
2020 | 10.1016/j.dsp.2020.102795 | Digital Signal Processing |
Keywords | DocType | Volume |
Mel scale,Frequency warping function,Speech-signal-based scale,Principal component analysis (PCA),NIST speaker recognition evaluation (SRE),VoxCeleb1 | Journal | 104 |
ISSN | Citations | PageRank |
1051-2004 | 1 | 0.36 |
References | Authors | |
0 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Susanta Sarangi | 1 | 1 | 0.36 |
Md. Sahidullah | 2 | 326 | 24.99 |
Goutam Saha | 3 | 255 | 23.17 |