Title
Word embeddings combination and neural networks for robustness in ASR error detection
Abstract
This study focuses on error detection in Automatic Speech Recognition (ASR) output. We propose to build a confidence classifier based on a neural network architecture, which is in charge to attribute a label (error or correct) for each word within an ASR hypothesis. This classifier uses word embed dings as inputs, in addition to ASR confidence-based, lexical and syntactic features. We propose to evaluate the impact of three different, kinds of word embeddings on this error detection approach, and we present a solution to combine these three different types of word embeddings in order to take advantage of their complementarily. In our experiments, different approaches are evaluated on the automatic transcriptions generated by two different ASR systems applied on the ETAPE corpus (French broadcast news). Experimental results show that the proposed neural architectures achieve a CER reduction comprised between 4% and 5.8% in error detection, depending on test dataset, in comparison with a state-of-the-art CRF approach.
Year
Venue
Keywords
2015
European Signal Processing Conference
Automatic speech recognition,confidence measures,neuronal networks,word embeddings
Field
DocType
ISSN
Broadcasting,Signal processing,Pattern recognition,Computer science,Feature extraction,Robustness (computer science),Error detection and correction,Speech recognition,Artificial intelligence,Classifier (linguistics),Artificial neural network,Syntax
Conference
2076-1465
Citations 
PageRank 
References 
4
0.40
12
Authors
3
Name
Order
Citations
PageRank
Sahar Ghannay140.40
Yannick Estève229850.89
Nathalie Camelin33914.29