Title | ||
---|---|---|
Efficient data selection for spoken document retrieval based on prior confidence estimation using speech and context independent models |
Abstract | ||
---|---|---|
This paper proposes an efficient speech sample selection technique that can identify those samples that will be well recognized. Conventional confidence measures can identify well-recognized speech samples, but they require speech recognition to estimate confidence scores. Speech samples with low confidence should not undergo recognition since they yield speech documents that will eventually be rejected. The proposed technique can select the samples that will justify the application of speech recognition. It is based on rapid prior confidence estimation by using speech and context independent models to calculate acoustic likelihood values on a frame-by-frame basis. Tests show that the proposed confidence estimation technique is over 50 times faster than the conventional posterior confidence measure while maintaining equivalent data selection performance for speech recognition and spoken document retrieval. |
Year | DOI | Venue |
---|---|---|
2010 | 10.1109/SLT.2010.5700851 | Spoken Language Technology Workshop |
Keywords | DocType | ISBN |
document handling,information retrieval,speech recognition,acoustic likelihood values,confidence estimation,context independent model,data selection,speech independent model,speech sample selection technique,spoken document retrieval,confidence measure | Conference | 978-1-4244-7902-3 |
Citations | PageRank | References |
0 | 0.34 | 17 |
Authors | ||
6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Satoshi Kobashikawa | 1 | 28 | 9.73 |
Taichi Asami | 2 | 22 | 10.49 |
Yoshikazu Yamaguchi | 3 | 77 | 11.18 |
Hirokazu Masataki | 4 | 18 | 9.21 |
takahashi satoshi | 5 | 0 | 0.34 |
ntt cyber | 6 | 6 | 1.62 |