Abstract | ||
---|---|---|
This paper presents the results of a longitudinal study of ASR performance on ageing voices. Experiments were conducted on the audio recordings of the proceedings of the Supreme Court Of The United States (SCOTUS). Results show that the Automatic Speech Recognition (ASR) Word Error Rates (WERs) for elderly voices are significantly higher than those of adult voices. The word error rate increases gradually as the age of the elderly speakers increase. Use of maximum likelihood linear regression (MLLR) based speaker adaptation on ageing voices improves the WER though the performance is still considerably lower compared to adult voices. Speaker adaptation however reduces the increase in WER with age during old age. |
Year | Venue | Keywords |
---|---|---|
2008 | INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5 | Ageing Voices, longitudinal study, SCOTUS corpus, MLLR |
Field | DocType | Citations |
Longitudinal study,Ageing,Supreme court,Computer science,Word error rate,Speech recognition,Maximum likelihood linear regression,Speaker adaptation | Conference | 8 |
PageRank | References | Authors |
0.77 | 3 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Ravichander Vipperla | 1 | 63 | 6.16 |
Steve Renals | 2 | 2570 | 293.02 |
Joe Frankel | 3 | 312 | 22.78 |