A comparative study on various confidence measures in large vocabulary speech recognition - Citegraph

Paper Info

Title
A comparative study on various confidence measures in large vocabulary speech recognition

Abstract
In this paper, we have conducted a comparative study on several confidence measures (CM) for large vocabulary speech recognition. Firstly, we propose a novel high-level CM that is based on the inter-word mutual information (MI). Secondly, we experimentally investigate several popular low-level CM, such as word posterior probabilities, N-best counting, likelihood ratio testing (LRT), etc. Finally, we have studied a simple linear interpolation strategy to combine the best low-level CM with the best high-level CM. All of these CM are examined in two large vocabulary ASR tasks, namely the Switchboard task and a Mandarin dictation task, to verify the recognition errors in baseline recognition systems. Experimental results show: (1) the proposed MI-based CM greatly surpass another existing high-level CM which are based on the LSA technique; (2) among all low-level CM, word posteriori probabilities give the best verification performance; (3) when combining the word posteriori probabilities with the MI-based CM, the equal error rate is reduced from 24.4% to 23.9% in the Switchboard task and from 17.5% to 16.2% in the Mandarin dictation task.

Year	DOI	Venue
2004	10.1109/CHINSL.2004.1409573	ISCSLP
Keywords	Field	DocType
switchboard task,speech recognition,interpolation,maximum likelihood estimation,vocabulary,lsa technique,large vocabulary speech recognition,equal error rate,word posterior probabilities,high-level confidence measures,inter-word mutual information,linear interpolation,asr,verification performance,likelihood ratio testing,n-best counting,baseline recognition systems,recognition errors,error statistics,mandarin dictation task,posterior probability,likelihood ratio test,comparative study,mutual information	Pattern recognition,Computer science,Interpolation,Word error rate,Speech recognition,Posterior probability,Dictation,Mutual information,Artificial intelligence,Linear interpolation,Vocabulary,Mandarin Chinese	Conference
ISBN	Citations	PageRank
0-7803-8678-7	16	1.02
References	Authors
5	4

Authors (4 rows)

Cited by (16 rows)

References (5 rows)

Name	Order	Citations	PageRank
Gang Guo	1	16	1.02
Chao Huang	2	218	23.06
Hui Jiang	3	1493	113.16
Ren-Hua Wang	4	344	41.36

1