Speechfind for CDP: Advances in spoken document retrieval for the U. S. collaborative digitization program - Citegraph

Paper Info

Title
Speechfind for CDP: Advances in spoken document retrieval for the U. S. collaborative digitization program

Abstract
This paper presents our recent advances for SpeechFind, a CRSS-UTD designed spoken document retrieval system for the U.S. based Collaborative Digitization Program (CDP). A proto-type of SpeechFind for the CDP is currently serving as the search engine for 1,300 hours of CDP audio content which contain a wide range of acoustic conditions, vocabulary and period selection, and topics. In an effort to determine the amount of user corrected transcripts needed to impact automatic speech recognition (ASR) and audio search, a web-based online interface for verification of ASR-generated transcripts was developed. The procedure for enhancing the transcription performance for SpeechFind is also presented. A selection of adaptation methods for language and acoustic models are employed depending on the acoustics of the corpora under test. Experimental results on the CDP corpus demonstrate that the employed model adaptation scheme using the verified transcripts is effective in improving recognition accuracy. Through a combination of feature/acoustic model enhancement and language model selection, up to 24.8% relative improvement in ASR was obtained. The SpeechFind system, employing automatic transcript generation, online CDP transcript correction, and our transcript reliability estimator, demonstrates a comprehensive support mechanism to ensure reliable transcription and search for U.S. libraries with limited speech technology experience.

Year	DOI	Venue
2007	10.1109/ASRU.2007.4430195	ASRU
Keywords	Field	DocType
speechfind,audio search,topics,speech processing,crss-utd designed spoken document retrieval system,speech recognition,online front-ends,audio indexing,audio acoustics,vocabulary,transcript verification,period selection,cdp audio content,speech-based user interfaces,acoustic conditions,indexing,search engine,us based collaborative digitization program,cdp,internet,spoken document retrieval,asr-generated transcript verification,speechfind system,model enhancement,ngsw,search engines,web-based online interface,automatic speech recognition,language model	Speech processing,Digitization,Search engine,Computer science,Search engine indexing,Speech recognition,Artificial intelligence,Natural language processing,Document retrieval,Vocabulary,Language model,Acoustic model	Conference
ISBN	Citations	PageRank
978-1-4244-1746-9	4	0.41
References	Authors
6	2

Authors (2 rows)

Cited by (4 rows)

References (6 rows)

Name	Order	Citations	PageRank
Wooil Kim	1	120	16.95
John H. L. Hansen	2	3215	365.75

1