Title
A logarithmic based pole-zero vocal tract model estimation for speaker verification
Abstract
In this paper we investigate the use of formant and anti formant measurements of nasal consonants for speaker verification. The features are obtained using a pole-zero vocal tract model estimate optimized by minimizing a logarithmic criterion which is motivated by the perception of amplitude by the human auditory system. A GMM-UBM approach is used for performing speaker comparisons within the likelihood-ratio framework. Results are compared with systems based on Mel Frequency Cepstral Coefficients (MFCCs) as well as formant center frequencies and bandwidths obtained using the Snack Toolkit. The formant and anti-formant based system attains comparable results to the MFCC system and outperforms the formant-based approach while offering a more straight for ward interpretation in terms of a physical speech production model.
Year
DOI
Venue
2011
10.1109/ICASSP.2011.5947434
ICASSP
Keywords
Field
DocType
speaker recognition,snack toolkit,formant-based approach,nasal consonants,gmm-ubm approach,likelihood-ratio framework,formants,pole-zero model,logarithmic based pole-zero vocal tract model estimation,straightforward interpretation,speech analysis,speech synthesis,anti-formants,human auditory system,physical speech production model,mel frequency cepstral coefficients,mfcc system,speaker verification,speech,vocal tract,computer model,forensics,likelihood ratio,computational modeling,feature extraction,mel frequency cepstral coefficient,production,speech production
Mel-frequency cepstrum,Speech synthesis,Pattern recognition,Computer science,Feature extraction,Speech recognition,Speaker recognition,Artificial intelligence,Logarithm,Formant,Speech production,Vocal tract
Conference
ISSN
ISBN
Citations 
1520-6149 E-ISBN : 978-1-4577-0537-3
978-1-4577-0537-3
3
PageRank 
References 
Authors
0.40
6
4
Name
Order
Citations
PageRank
Ewald Enzinger193.44
Peter Balazs212510.83
Damián Marelli316419.58
Timo Becker4171.56