Title | ||
---|---|---|
Information Theoretic Expectation Maximization Based Gaussian Mixture Modeling for Speaker Verification |
Abstract | ||
---|---|---|
The expectation maximization (EM) algorithm is widely used in the Gaussian mixture model (GMM) as the state-of-art statistical modeling technique. Like the classical EM method, the proposed EM-Information Theoretic algorithm (EM-IT) adapts means, covariances and weights, however this process is not conducted directly on feature vectors but on a smaller set of centroids derived by the information theoretic procedure, which simultaneously minimizes the divergence between the Parzen estimates of the feature vector’s distribution within a given Gaussian component and the centroid’s distribution within the same Gaussian component. The EM-IT algorithm was applied to the speaker verification problem using NIST 2004 speech corpus and the MFCC with dynamic features. The results showed an improvement of the equal error rate (ERR) by 1.5% over the classical EM approach. The EM-IT also showed higher convergence rates compare to the EM method. |
Year | DOI | Venue |
---|---|---|
2010 | 10.1109/ICPR.2010.1102 | Pattern Recognition |
Keywords | Field | DocType |
Gaussian processes,covariance analysis,expectation-maximisation algorithm,information theory,speaker recognition,Gaussian mixture modeling,Parzen estimates,centroids,covariances,information theoretic expectation maximization,speaker verification,statistical modeling technique | Feature vector,Pattern recognition,Computer science,Expectation–maximization algorithm,Gaussian,Speaker recognition,Vector quantization,Statistical model,Gaussian process,Artificial intelligence,Mixture model | Conference |
ISSN | ISBN | Citations |
1051-4651 | 978-1-4244-7542-1 | 0 |
PageRank | References | Authors |
0.34 | 12 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Sheeraz Memon | 1 | 21 | 2.88 |
Margaret Lech | 2 | 239 | 24.84 |
Namunu C. Maddage | 3 | 345 | 26.51 |