Abstract | ||
---|---|---|
Selecting well-recognized transcripts is critical if information retrieval systems are to extract business intelligence from massive spoken document databases. To achieve this goal, we target spoken document confidence measures that represent the recognition rates of each document. We focus on the incoherent word occurrences over several utterances in ill-recognized transcripts of spoken documents. The proposed method uses contextual coherence as a measure of spoken document confidence. The contextual coherence is formulated as the mean of pointwise mutual information (PMI). We also propose a smoothing method of PMI, which deals with the data sparseness problem. Compared to the conventional method, our smoothing technique offers improved correlation coefficients between spoken document confidence scores and recognition rates from 0.573 to 0.672. Moreover, an even higher correlation coefficient, 0.710, is achieved by combining the contextual-based and decoder-based confidence measures. |
Year | Venue | Keywords |
---|---|---|
2011 | 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 | speech recognition, confidence measures, spoken documents, contextual coherence |
Field | DocType | Citations |
Pattern recognition,Computer science,Speech recognition,Coherence (physics),Natural language processing,Artificial intelligence | Conference | 5 |
PageRank | References | Authors |
0.48 | 1 | 6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Taichi Asami | 1 | 22 | 10.49 |
Narichika Nomoto | 2 | 6 | 0.84 |
Satoshi Kobashikawa | 3 | 28 | 9.73 |
Yoshikazu Yamaguchi | 4 | 77 | 11.18 |
Hirokazu Masataki | 5 | 18 | 9.21 |
Satoshi Takahashi | 6 | 5 | 0.48 |