Title
Region Dependent Transform On Mlp Features For Speech Recognition
Abstract
In this work, Region Dependent Transform (RDT) is used as a feature extraction process to combine the traditional short-term acoustic features with the features derived from Multi-Layer Perceptrons (MLP) which is trained from the long-term features. When compared to the conventional feature augmentation approach, substantial improvement is obtained. Moreover, an improved RDT training procedure in which speaker dependent transforms are take into account is proposed for feature combinination in the Speaker Adaptive Training. By incorporating the higher dimensional features output from the layer prior to the bottleneck layer into our Speech-to-Text (SIT) system using RDT, significant improvement is achieved as compared to using the conventional bottleneck features. In summary, by using the features derived from MLP with RDT, 8.2% to 11.4% relative reduction in Character Error Rate is achieved for our Mandarin STT systems.
Year
Venue
Keywords
2011
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5
Multi-Layer Perceptrons, bottleneck features, Region Dependent Transform, discriminative training, Mandarin speech recognition
Field
DocType
Citations 
Pattern recognition,Computer science,Speech recognition,Artificial intelligence
Conference
4
PageRank 
References 
Authors
0.51
1
4
Name
Order
Citations
PageRank
Tim Ng11229.38
Bing Zhang2131.56
Spyridon Matsoukas318010.21
Long Nguyen432684.60