Abstract | ||
---|---|---|
Maximum a posterior (MAP) adaptation is one of the popular and powerful methods for obtaining a speaker-specific acoustic model. Basically, MAP adaptation needs a data storage for speaker adaptive (SA) model as much as speaker independent (SI) model needs. Modern speech recognition systems have a huge number of parameters and deal with millions of users. To reduce the data storage for SA models, in this paper, we propose a constrained maximum likelihood estimation-based speaker adaptation with L1 regularization. By the proposed method, we can more efficiently perform the model adjustments for SA models without almost any loss of phone recognition performance than the conventional sparse MAP adaptation method. |
Year | DOI | Venue |
---|---|---|
2014 | 10.1109/ICASSP.2014.6854830 | ICASSP |
Keywords | Field | DocType |
speaker adaptation,euclidean projection on l1 ball,maximum likelihood estimation,maximum a posterior adaptation,l1 regularization,speech recognition systems,speaker recognition,constrained mle,constrained mle based speaker adaptation | Map adaptation,Pattern recognition,Computer science,Computer data storage,Maximum likelihood,Speech recognition,Regularization (mathematics),Phone,Artificial intelligence,Speaker adaptation,Acoustic model | Conference |
ISSN | Citations | PageRank |
1520-6149 | 1 | 0.35 |
References | Authors | |
7 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Younggwan Kim | 1 | 17 | 6.11 |
Hoirin Kim | 2 | 4 | 2.41 |