A Multi-Region Deep Neural Network Model In Speech Recognition - Citegraph

Paper Info

Title
A Multi-Region Deep Neural Network Model In Speech Recognition

Abstract
This work proposes a new architecture for deep neural network training. Instead of having one cascade of fully connected hidden layers between the input features and the target output, the new architecture organizes hidden layers into several regions with each region having its own target. Regions communicate with each other during the training process by connections among intermediate hidden layers to share learned internal representations from their respective targets. They do not have to share the same input features. This paper presents the performance of acoustic models built using this architecture with speaker independent and dependent features. Experimental results are compared with not only the baseline DNN model, but also the ensemble DNN, unfolded RNN and stacked DNN. Experiments on the IARPA sponsored Babel tasks demonstrate improvements ranging from 0.8% to 2.7% absolute reduction in WER.

Year	Venue	Keywords
2015	16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5	speech recognition, deep neural network, multitask training
Field	DocType	Citations
Neocognitron,Pattern recognition,Computer science,Speech recognition,Time delay neural network,Artificial intelligence,Deep learning,Artificial neural network	Conference	0
PageRank	References	Authors
0.34	0	4

Authors (4 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Jia Cui	1	94	6.26
George Saon	2	825	80.99
Bhuvana Ramabhadran	3	1779	153.83
B. Kingsbury	4	4175	335.43

1