Word embeddings combination and neural networks for robustness in ASR error detection - Citegraph

Paper Info

Title
Word embeddings combination and neural networks for robustness in ASR error detection

Abstract
This study focuses on error detection in Automatic Speech Recognition (ASR) output. We propose to build a confidence classifier based on a neural network architecture, which is in charge to attribute a label (error or correct) for each word within an ASR hypothesis. This classifier uses word embed dings as inputs, in addition to ASR confidence-based, lexical and syntactic features. We propose to evaluate the impact of three different, kinds of word embeddings on this error detection approach, and we present a solution to combine these three different types of word embeddings in order to take advantage of their complementarily. In our experiments, different approaches are evaluated on the automatic transcriptions generated by two different ASR systems applied on the ETAPE corpus (French broadcast news). Experimental results show that the proposed neural architectures achieve a CER reduction comprised between 4% and 5.8% in error detection, depending on test dataset, in comparison with a state-of-the-art CRF approach.

Year	Venue	Keywords
2015	European Signal Processing Conference	Automatic speech recognition,confidence measures,neuronal networks,word embeddings
Field	DocType	ISSN
Broadcasting,Signal processing,Pattern recognition,Computer science,Feature extraction,Robustness (computer science),Error detection and correction,Speech recognition,Artificial intelligence,Classifier (linguistics),Artificial neural network,Syntax	Conference	2076-1465
Citations	PageRank	References
4	0.40	12
Authors
3

Authors (3 rows)

Cited by (4 rows)

References (12 rows)

Name	Order	Citations	PageRank
Sahar Ghannay	1	4	0.40
Yannick Estève	2	298	50.89
Nathalie Camelin	3	39	14.29

1