Title
Exploring Retraining-Free Speech Recognition For Intra-Sentential Code-Switching
Abstract
Code Switching refers to the phenomenon of changing languages within a sentence or discourse, and it represents a challenge for conventional automatic speech recognition systems deployed to tackle a single target language. The code switching problem is complicated by the lack of multi-lingual training data needed to build new and ad hoc multi-lingual acoustic and language models. In this work, we present a prototype research code-switching speech recognition system that leverages existing monolingual acoustic and language models, i.e., no ad hoc training is needed. To generate high quality pronunciation of foreign language words in the native language phoneme set, we use a combination of existing acoustic phone decoders and an LSTM-based grapheme-to-phoneme model. In addition, a code-switching language model was developed by using translated word pairs to borrow statistics from the native language model. We demonstrate that our approach handles accented foreign pronunciations better than techniques based on human labeling. Our best system reduces the WER from 34.4%, obtained with a conventional monolingual speech recognition system, to 15.3% on an intrasentential code-switching task, without harming the monolingual accuracy.
Year
DOI
Venue
2019
10.1109/icassp.2019.8682478
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP)
Keywords
Field
DocType
multilingual speech recognition, code-switching, retraining-free
Pronunciation,Data modeling,Pattern recognition,Computer science,Code-switching,Speech recognition,Phone,Artificial intelligence,Sentence,First language,Language model,Foreign language
Conference
ISSN
Citations 
PageRank 
1520-6149
0
0.34
References 
Authors
0
6
Name
Order
Citations
PageRank
Zhen Huang110011.60
Xiaodan Zhuang201.01
Daben Liu301.01
Xiao Xiaoqiang401.01
Yuchen Zhang5287.80
Sabato Marco Siniscalchi631030.21