Title
Learning from Past Mistakes: Improving Automatic Speech Recognition Output via Noisy-Clean Phrase Context Modeling.
Abstract
Automatic speech recognition (ASR) systems often make unrecoverable errors due to subsystem pruning (acoustic, language and pronunciation models); for example, pruning words due to acoustics using short-term context, prior to rescoring with long-term context based on linguistics. In this work, we model ASR as a phrase-based noisy transformation channel and propose an error correction system that can learn from the aggregate errors of all the independent modules constituting the ASR and attempt to invert those. The proposed system can exploit long-term context using a neural network language model and can better choose between existing ASR output possibilities as well as re-introduce previously pruned or unseen (Out-Of-Vocabulary) phrases. It provides corrections under poorly performing ASR conditions without degrading any accurate transcriptions; such corrections are greater on top of out-of-domain andmismatched data ASR. Our systemconsistently provides improvements over the baseline ASR, even when baseline is further optimized through Recurrent Neural Network (RNN) languagemodel rescoring. This demonstrates that any ASR improvements can be exploited independently and that our proposed system can potentially still provide benefits on highly optimized ASR. Finally, we present an extensive analysis of the type of errors corrected by our system.
Year
DOI
Venue
2018
10.1017/ATSIP.2018.31
APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING
Keywords
Field
DocType
Error correction,Speech recognition,Phrase-based context modeling,Noise channel estimation,Neural Network Language Model
Computer science,Communication channel,Phrase,Error detection and correction,Speech recognition,Context model,Artificial intelligence,Test data,Decoding methods,Artificial neural network,Language model,Machine learning
Journal
Volume
ISSN
Citations 
8
2048-7703
2
PageRank 
References 
Authors
0.47
28
4
Name
Order
Citations
PageRank
Prashanth Gurunath Shivakumar120.47
Haoqi Li245.74
Kevin Knight35096462.44
Georgiou Panayiotis442855.79