Title
Progress On Mandarin Conversational Telephone Speech Recognition
Abstract
Over the past decade, there has been good progress on English conversational telephone speech (CTS) recognition, built on the Switchboard and Fisher corpora. In this paper, we present our efforts on extending language-independent technologies into Mandarin CTS, as well as addressing language-dependent issues such as tone. We will show the impact of each of the following factors: (a) simplified Mandarin phone set, (b) pitch features, (c) auto-retrieved web texts for augmenting n-gram training, (d) speaker adaptive training, (e) maximum mutual information estimation, (f) decision-tree-based parameter sharing, (g) cross-word co-articulation modeling, and (h) combining MFCC and PLP decoding outputs using confusion networks. We have reduced the Chinese character error rate (CER) of the BBN-2003 development test set from 53.8% to 46.8% after (a)+(b)+(c)+(f)+(g) are combined. Further reduction in CER is anticipated after integrating all improvements.
Year
DOI
Venue
2004
10.1109/CHINSL.2004.1409571
2004 International Symposium on Chinese Spoken Language Processing, Proceedings
Keywords
Field
DocType
feature extraction,decision tree,parameter estimation,speaker recognition,speech processing,decision trees
Mel-frequency cepstrum,Speech processing,Pattern recognition,Computer science,Word error rate,Speech recognition,Speaker recognition,Artificial intelligence,Mutual information,Decoding methods,Mandarin Chinese,Test set
Conference
Citations 
PageRank 
References 
4
0.64
11
Authors
13
Name
Order
Citations
PageRank
Mei-Yuh Hwang1477124.33
Xin Lei252.01
Tim Ng31229.38
Ivan Bulyko424922.40
Mari Ostendorf52462348.75
Andreas Stolcke66690712.46
Wen Wang7344.08
Jing Zheng844243.00
Venkata Ramana940.64
Venkata Ramana Rao Gadde1018815.83
Martin Graciarena1128124.70
Manhung Siu1246461.40
Yan Huang1340.64