Title
Recent Progress of Mandrain Spontaneous Speech Recognition on Mandrain Conversation Dialogue Corpus
Abstract
This paper presents a progress report on a relatively difficult ASR task on a spontaneous speech corpus - Mandarin Conversational Dialogue Corpus (MCDC). A DNN-based acoustic model is constructed based on the CLDNN structure with a large dataset that comprises two spontaneous-speech corpora and one read-speech corpus. The study uses a large text dataset formed by seven corpora to train an efficient general language model (LM). Two adapted LMs specially for spontaneous speech recognition are also constructed. Experimental results showed that the best performances of 26.3% in character error rate (CER) and 32.5% in word error rate (WER) were reached on MCDC. They represented 27.9% and 22.2% of relative CER and WER reductions as compared with the performances by the previous best HMM-based method. This confirms that the proposed method is promising in tackling on Mandarin spontaneous speech recognition.
Year
DOI
Venue
2019
10.1109/O-COCOSDA46868.2019.9041223
2019 22nd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA)
Keywords
DocType
ISSN
Spontaneous Speech Recognition,CLDNN,MCDC Corpus
Conference
2163-3479
ISBN
Citations 
PageRank 
978-1-7281-2450-6
0
0.34
References 
Authors
6
4
Name
Order
Citations
PageRank
Yu-Chih Deng100.34
Yih-Ru Wang223734.68
Sin-Horng Chen327339.86
Chen-Yu Chiang43111.55