Improving BLSTM RNN based Mandarin speech recognition using accent dependent bottleneck features. - Citegraph

Paper Info

Title
Improving BLSTM RNN based Mandarin speech recognition using accent dependent bottleneck features.

Abstract
This paper proposes an approach to perform accent adaptation by using accent dependent bottleneck (BN) features to improve the performance of multi-accent Mandarin speech recognition system. The architecture of the adaptation uses two neural networks. First, deep neural network (DNN) acoustic model acts as a feature extractor which is used to extract accent dependent BN (BN-DNN) features. The input features of the BN-DNN model are MFCC features appended with i-vectors features. Second, bidirectional long short term memory (BLSTM) recurrent neural network (RNN) based acoustic model is used to perform accent-specific adaptation. The input features of the BLSTM RNN model are accent dependent BN features appended with MFCC features. Experiments on RASC863 and CASIA regional accent speech corpus show that the proposed method obtains obvious improvement compared with the BLSTM RNN baseline model.

Year	Venue	Field
2016	Asia-Pacific Signal and Information Processing Association Annual Summit and Conference	Speech corpus,Mel-frequency cepstrum,Bottleneck,Pattern recognition,Computer science,Recurrent neural network,Speech recognition,Feature extraction,Artificial intelligence,Artificial neural network,Mandarin speech recognition,Acoustic model
DocType	ISSN	Citations
Conference	2309-9402	0
PageRank	References	Authors
0.34	0	4

Authors (4 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Jiangyan Yi	1	19	17.99
Hao Ni	2	0	0.68
Zhengqi Wen	3	86	24.41
Jianhua Tao	4	848	138.00

1