Title
Speechfind for CDP: Advances in spoken document retrieval for the U. S. collaborative digitization program
Abstract
This paper presents our recent advances for SpeechFind, a CRSS-UTD designed spoken document retrieval system for the U.S. based Collaborative Digitization Program (CDP). A proto-type of SpeechFind for the CDP is currently serving as the search engine for 1,300 hours of CDP audio content which contain a wide range of acoustic conditions, vocabulary and period selection, and topics. In an effort to determine the amount of user corrected transcripts needed to impact automatic speech recognition (ASR) and audio search, a web-based online interface for verification of ASR-generated transcripts was developed. The procedure for enhancing the transcription performance for SpeechFind is also presented. A selection of adaptation methods for language and acoustic models are employed depending on the acoustics of the corpora under test. Experimental results on the CDP corpus demonstrate that the employed model adaptation scheme using the verified transcripts is effective in improving recognition accuracy. Through a combination of feature/acoustic model enhancement and language model selection, up to 24.8% relative improvement in ASR was obtained. The SpeechFind system, employing automatic transcript generation, online CDP transcript correction, and our transcript reliability estimator, demonstrates a comprehensive support mechanism to ensure reliable transcription and search for U.S. libraries with limited speech technology experience.
Year
DOI
Venue
2007
10.1109/ASRU.2007.4430195
ASRU
Keywords
Field
DocType
speechfind,audio search,topics,speech processing,crss-utd designed spoken document retrieval system,speech recognition,online front-ends,audio indexing,audio acoustics,vocabulary,transcript verification,period selection,cdp audio content,speech-based user interfaces,acoustic conditions,indexing,search engine,us based collaborative digitization program,cdp,internet,spoken document retrieval,asr-generated transcript verification,speechfind system,model enhancement,ngsw,search engines,web-based online interface,automatic speech recognition,language model
Speech processing,Digitization,Search engine,Computer science,Search engine indexing,Speech recognition,Artificial intelligence,Natural language processing,Document retrieval,Vocabulary,Language model,Acoustic model
Conference
ISBN
Citations 
PageRank 
978-1-4244-1746-9
4
0.41
References 
Authors
6
2
Name
Order
Citations
PageRank
Wooil Kim112016.95
John H. L. Hansen23215365.75