Spoken Document Confidence Estimation Using Contextual Coherence - Citegraph

Paper Info

Title
Spoken Document Confidence Estimation Using Contextual Coherence

Abstract
Selecting well-recognized transcripts is critical if information retrieval systems are to extract business intelligence from massive spoken document databases. To achieve this goal, we target spoken document confidence measures that represent the recognition rates of each document. We focus on the incoherent word occurrences over several utterances in ill-recognized transcripts of spoken documents. The proposed method uses contextual coherence as a measure of spoken document confidence. The contextual coherence is formulated as the mean of pointwise mutual information (PMI). We also propose a smoothing method of PMI, which deals with the data sparseness problem. Compared to the conventional method, our smoothing technique offers improved correlation coefficients between spoken document confidence scores and recognition rates from 0.573 to 0.672. Moreover, an even higher correlation coefficient, 0.710, is achieved by combining the contextual-based and decoder-based confidence measures.

Year	Venue	Keywords
2011	12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5	speech recognition, confidence measures, spoken documents, contextual coherence
Field	DocType	Citations
Pattern recognition,Computer science,Speech recognition,Coherence (physics),Natural language processing,Artificial intelligence	Conference	5
PageRank	References	Authors
0.48	1	6

Authors (6 rows)

Cited by (5 rows)

References (1 rows)

Name	Order	Citations	PageRank
Taichi Asami	1	22	10.49
Narichika Nomoto	2	6	0.84
Satoshi Kobashikawa	3	28	9.73
Yoshikazu Yamaguchi	4	77	11.18
Hirokazu Masataki	5	18	9.21
Satoshi Takahashi	6	5	0.48

1