Using stacked transformations for recognizing foreign accented speech - Citegraph

Paper Info

Title
Using stacked transformations for recognizing foreign accented speech

Abstract
A common problem in speech recognition for foreign accented speech is that there is not enough training data for an accent-specific or a speaker-specific recognizer. Speaker adaptation can be used to improve the accuracy of a speaker independent recognizer, but a lot of adaptation data is needed for speakers with a strong foreign accent. In this paper we propose a rather simple and successful technique of stacked transformations where the baseline models trained for native speakers are first adapted by using accent-specific data and then by another transformation using speaker-specific data. Because the accent-specific data can be collected offline, the first transformation can be more detailed and comprehensive, and the second one less detailed and fast. Experimental results are provided for speaker adaptation in English spoken by Finnish speakers. The evaluation results confirm that the stacked transformations are very helpful for fast speaker adaptation.

Year	DOI	Venue
2011	10.1109/ICASSP.2011.5947481	Acoustics, Speech and Signal Processing
Keywords	Field	DocType
speech recognition,accent-specific recognizer,fast speaker adaptation,foreign accented speech recognition,speaker- independent recognizer,speaker-specific recognizer,stacked transformation,automatic speech recognition,cmllr transformation,foreign-accent recognition,stacked transformations	Training set,Data modeling,Digital signal processing,Pattern recognition,Computer science,Speech recognition,Natural language processing,Artificial intelligence,Hidden Markov model,Speaker adaptation	Conference
ISSN	ISBN	Citations
1520-6149 E-ISBN : 978-1-4577-0537-3	978-1-4577-0537-3	1
PageRank	References	Authors
0.35	9	2

Authors (2 rows)

Cited by (1 rows)

References (9 rows)

Name	Order	Citations	PageRank
Peter Smit	1	18	5.08
Mikko Kurimo	2	908	93.37

1