Low-Memory Fast On-Line Adaptation For Acoustically Mismatched Children'S Speech Recognition - Citegraph

Paper Info

Title
Low-Memory Fast On-Line Adaptation For Acoustically Mismatched Children'S Speech Recognition

Abstract
This work focuses on the issues and the challenges in acoustic adaptation in context of on-line children's speech recognition. When children's speech is decoded on adults' speech trained acoustic models, severely degraded recognition performance is noted on account of extreme acoustic mismatch. Though a number of conventional adaptation techniques are available, they are found to be undesirably latent for an on-line task. For addressing the same, in this work we have combined two low complexity fast adaptation techniques, namely acoustic model interpolation and low-rank feature projection. Two schemes for doing the same are presented in this work. In the first approach, model interpolation is done using weights estimated in unconstrained fashion. The other approach is a hybrid one in which a set mean supervectors are pre-estimated using suitable developmental data. Those are then optimally scaled using the given test data. Though the unconstrained approach results in better improvements over baseline, it has a higher complexity and memory requirements. In case of the hybrid approach, for interpolating M models, the number of parameters to be estimated and memory requirements are reduced by a factor of (M - 1).

Year	Venue	Keywords
2015	16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5	Speech recognition, acoustic mismatch, feature projection, fast adaptation
Field	DocType	Citations
Pattern recognition,Computer science,Speech recognition,Artificial intelligence	Conference	4
PageRank	References	Authors
0.40	0	2

Authors (2 rows)

Cited by (4 rows)

References (0 rows)

Name	Order	Citations	PageRank
S. Shahnawazuddin	1	64	17.34
Rohit Sinha	2	231	30.54

1