Title
End-to-end keywords spotting based on connectionist temporal classification for Mandarin
Abstract
Traditional hybrid DNN-HMM based ASR system for keywords spotting which models HMM states are not flexible to optimize for a specific language. In this paper, we construct an end-to-end acoustic model based ASR for keywords spotting in Mandarin. This model is constructed by LSTM-RNN and trained with objective measure of connectionist temporal classification. The input of the network is feature sequences, and the output the probabilities of the initials and finals of Mandarin syllables. Compared with hybrid based ASR systems, the end-to-end system achieves a significant improvement of 6.32% on ATWV relatively. The best result of our system is ATWV 0.8310 on RASC863 data set. The proposed CTC based method applies to KWS in a specific language.
Year
DOI
Venue
2016
10.1109/ISCSLP.2016.7918460
2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP)
Keywords
Field
DocType
keywords spotting,LSTM-RNN,connectionist temporal classification,end-to-end
Computer science,End-to-end principle,Speech recognition,Natural language processing,Artificial intelligence,Hidden Markov model,Spotting,Connectionism,Mandarin Chinese,Acoustic model
Conference
ISBN
Citations 
PageRank 
978-1-5090-4295-1
0
0.34
References 
Authors
0
7
Name
Order
Citations
PageRank
Ye Bai175.52
Jiangyan Yi21917.99
Hao Ni3213.25
Zhengqi Wen48624.41
Bin Liu519135.02
Ya Li63611.21
Jianhua Tao7848138.00