Abstract | ||
---|---|---|
A novel discriminative training method of Gaussian mixture model for text-independent speaker verification, Figure of Merit (FOM) training, is proposed in this paper. FOM training aims at maximizing the FOM of a ROC curve by adjusting the model parameters, rather than only approximating the underlying dis- tribution of acoustic observations of each speaker that Maximum Likelihood Estimation does. The text-independent speaker veri- fication experiments were conducted on the 1996 NIST Speaker Recognition Evaluation corpus. Compared with standard EM training method, FOM training provides significantly improved performance, e.g. the detection cost function (DCF) was reduced to 0.0286 from 0.0369 and to 0.0537 from 0.0826 in matched and mismatched conditions respectively. |
Year | DOI | Venue |
---|---|---|
2002 | 10.1109/ICASSP.2002.5743812 | ICASSP |
Field | DocType | Volume |
Speaker verification,Pattern recognition,Computer science,Maximum likelihood,Figure of merit,Speech recognition,Speaker recognition,NIST,Artificial intelligence,Artificial neural network,Discriminative model,Mixture model | Conference | 1 |
ISSN | Citations | PageRank |
1520-6149 | 5 | 0.67 |
References | Authors | |
5 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Xiaohan Li | 1 | 5 | 1.68 |
Eric Chang | 2 | 625 | 49.79 |
Beiqian Dai | 3 | 51 | 8.56 |