Title
Speaker recognition utilizing distributed DCT-II based Mel frequency cepstral coefficients and fuzzy vector quantization
Abstract
In this paper, a new and novel Automatic Speaker Recognition (ASR) system is presented. The new ASR system includes novel feature extraction and vector classification steps utilizing distributed Discrete Cosine Transform (DCT-II) based Mel Frequency Cepstral Coefficients (MFCC) and Fuzzy Vector Quantization (FVQ). The ASR algorithm utilizes an approach based on MFCC to identify dynamic features that are used for Speaker Recognition (SR). A series of experiments were performed utilizing three different feature extraction methods: (1) conventional MFCC; (2) Delta-Delta MFCC (DDMFCC); and (3) DCT-II based DDMFCC. The experiments were then expanded to include four classifiers: (1) FVQ; (2) K-means Vector Quantization (VQ); (3) Linde, Buzo and Gray VQ; and (4) Gaussian Mixed Model (GMM). The combination of DCT-II based MFCC, DMFCC and DDMFCC with FVQ was found to have the lowest Equal Error Rate for the VQ based classifiers. The results found were an improvement over previously reported non-GMM methods and approached the results achieved for the computationally expensive GMM based method. Speaker verification tests carried out highlighted the overall performance improvement for the new ASR system. The National Institute of Standards and Technology Speaker Recognition Evaluation corpora was used to provide speaker source data for the experiments.
Year
DOI
Venue
2013
10.1007/s10772-012-9166-0
I. J. Speech Technology
Keywords
Field
DocType
Speaker recognition, Discrete cosine transform, Fuzzy vector quantization, K-Means, Linde–Buzo–Gray, Mel frequency cepstral coefficients, Speech feature extraction
Mel-frequency cepstrum,k-means clustering,Pattern recognition,Computer science,Word error rate,Discrete cosine transform,Speech recognition,Feature extraction,Vector quantization,Gaussian,Speaker recognition,Artificial intelligence
Journal
Volume
Issue
ISSN
16
1
1572-8110
Citations 
PageRank 
References 
2
0.42
10
Authors
2
Name
Order
Citations
PageRank
M. Afzal Hossan120.42
Mark A. Gregory24414.36