Title
Efficient data selection for spoken document retrieval based on prior confidence estimation using speech and context independent models
Abstract
This paper proposes an efficient speech sample selection technique that can identify those samples that will be well recognized. Conventional confidence measures can identify well-recognized speech samples, but they require speech recognition to estimate confidence scores. Speech samples with low confidence should not undergo recognition since they yield speech documents that will eventually be rejected. The proposed technique can select the samples that will justify the application of speech recognition. It is based on rapid prior confidence estimation by using speech and context independent models to calculate acoustic likelihood values on a frame-by-frame basis. Tests show that the proposed confidence estimation technique is over 50 times faster than the conventional posterior confidence measure while maintaining equivalent data selection performance for speech recognition and spoken document retrieval.
Year
DOI
Venue
2010
10.1109/SLT.2010.5700851
Spoken Language Technology Workshop
Keywords
DocType
ISBN
document handling,information retrieval,speech recognition,acoustic likelihood values,confidence estimation,context independent model,data selection,speech independent model,speech sample selection technique,spoken document retrieval,confidence measure
Conference
978-1-4244-7902-3
Citations 
PageRank 
References 
0
0.34
17
Authors
6
Name
Order
Citations
PageRank
Satoshi Kobashikawa1289.73
Taichi Asami22210.49
Yoshikazu Yamaguchi37711.18
Hirokazu Masataki4189.21
takahashi satoshi500.34
ntt cyber661.62