Title
Joint Modeling of Code-Switched and Monolingual ASR via Conditional Factorization.
Abstract
Conversational bilingual speech encompasses three types of utterances: two purely monolingual types and one intra-sententially code-switched type. In this work, we propose a general framework to jointly model the likelihoods of the monolingual and code-switch sub-tasks that comprise bilingual speech recognition. By defining the monolingual sub-tasks with label-to-frame synchronization, our joint modeling framework can be conditionally factorized such that the final bilingual output, which may or may not be code-switched, is obtained given only monolingual information. We show that this conditionally factorized joint framework can be modeled by an end-to-end differentiable neural network. We demonstrate the efficacy of our proposed model on bilingual Mandarin-English speech recognition across both monolingual and code-switched corpora.
Year
DOI
Venue
2022
10.1109/ICASSP43922.2022.9747537
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
DocType
Citations 
PageRank 
Conference
0
0.34
References 
Authors
0
9
Name
Order
Citations
PageRank
Brian Yan102.37
Chunlei Zhang2377.43
Meng Yu352466.52
Shi-Xiong Zhang4186.75
Siddharth Dalmia500.34
Dan Berrebbi601.35
Chao Weng711319.75
Shinji Watanabe81158139.38
Dong Yu96264475.73