Title | ||
---|---|---|
The Cambridge University 2014 Bolt Conversational Telephone Mandarin Chinese Lvcsr System For Speech Translation |
Abstract | ||
---|---|---|
This paper presents the development of the 2014 Cambridge University conversational telephone Mandarin Chinese LVCSR system for the DARPA BOLT speech translation evaluation. A range of advanced modelling techniques were employed to both improve the recognition performance and provide a suitable integration with the translation system. These include an improved system combination technique using frame level acoustic model combination via joint decoding. Sequence trained deep neural network (DNN) based hybrid and tandem systems were combined on-the-fly to produce a consistent decoding output during search. A multi-level paraphrastic recurrent neural network LM (RNNLM) modelling both alternative paraphrase expressions and character sequences while preserving a consistent character to word segmentation was also used. This system gave an overall character error rate (CER) of 29.1% on the BOLT dev14 development set. |
Year | Venue | Keywords |
---|---|---|
2015 | 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5 | conversational speech transcription, speech translation, system combination, RNNLM, character LM |
Field | DocType | Citations |
Expression (mathematics),Computer science,Word error rate,Recurrent neural network,Text segmentation,Speech recognition,Natural language processing,Artificial intelligence,Speech translation,Artificial neural network,Mandarin Chinese,Acoustic model | Conference | 5 |
PageRank | References | Authors |
0.42 | 21 | 6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Xunying Liu | 1 | 330 | 52.46 |
Federico Flego | 2 | 55 | 6.19 |
Linlin Wang | 3 | 26 | 3.91 |
Chao Zhang | 4 | 95 | 9.70 |
Mark J. F. Gales | 5 | 3905 | 367.45 |
Philip C. Woodland | 6 | 4097 | 488.66 |