Region Dependent Transform On Mlp Features For Speech Recognition - Citegraph

Paper Info

Title
Region Dependent Transform On Mlp Features For Speech Recognition

Abstract
In this work, Region Dependent Transform (RDT) is used as a feature extraction process to combine the traditional short-term acoustic features with the features derived from Multi-Layer Perceptrons (MLP) which is trained from the long-term features. When compared to the conventional feature augmentation approach, substantial improvement is obtained. Moreover, an improved RDT training procedure in which speaker dependent transforms are take into account is proposed for feature combinination in the Speaker Adaptive Training. By incorporating the higher dimensional features output from the layer prior to the bottleneck layer into our Speech-to-Text (SIT) system using RDT, significant improvement is achieved as compared to using the conventional bottleneck features. In summary, by using the features derived from MLP with RDT, 8.2% to 11.4% relative reduction in Character Error Rate is achieved for our Mandarin STT systems.

Year	Venue	Keywords
2011	12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5	Multi-Layer Perceptrons, bottleneck features, Region Dependent Transform, discriminative training, Mandarin speech recognition
Field	DocType	Citations
Pattern recognition,Computer science,Speech recognition,Artificial intelligence	Conference	4
PageRank	References	Authors
0.51	1	4

Authors (4 rows)

Cited by (4 rows)

References (1 rows)

Name	Order	Citations	PageRank
Tim Ng	1	122	9.38
Bing Zhang	2	13	1.56
Spyridon Matsoukas	3	180	10.21
Long Nguyen	4	326	84.60

1