Title
Spoken Document Confidence Estimation Using Contextual Coherence
Abstract
Selecting well-recognized transcripts is critical if information retrieval systems are to extract business intelligence from massive spoken document databases. To achieve this goal, we target spoken document confidence measures that represent the recognition rates of each document. We focus on the incoherent word occurrences over several utterances in ill-recognized transcripts of spoken documents. The proposed method uses contextual coherence as a measure of spoken document confidence. The contextual coherence is formulated as the mean of pointwise mutual information (PMI). We also propose a smoothing method of PMI, which deals with the data sparseness problem. Compared to the conventional method, our smoothing technique offers improved correlation coefficients between spoken document confidence scores and recognition rates from 0.573 to 0.672. Moreover, an even higher correlation coefficient, 0.710, is achieved by combining the contextual-based and decoder-based confidence measures.
Year
Venue
Keywords
2011
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5
speech recognition, confidence measures, spoken documents, contextual coherence
Field
DocType
Citations 
Pattern recognition,Computer science,Speech recognition,Coherence (physics),Natural language processing,Artificial intelligence
Conference
5
PageRank 
References 
Authors
0.48
1
6
Name
Order
Citations
PageRank
Taichi Asami12210.49
Narichika Nomoto260.84
Satoshi Kobashikawa3289.73
Yoshikazu Yamaguchi47711.18
Hirokazu Masataki5189.21
Satoshi Takahashi650.48