Title | ||
---|---|---|
Improving Keyword Recognition of Spoken Queries by Combining Multiple Speech Recognizer's Outputs for Speech-driven WEB Retrieval Task |
Abstract | ||
---|---|---|
This paper presents speech-driven Web retrieval models which accept spoken search topics (queries) in the NTCIR-3 Web retrieval task. The major focus of this paper is on improving speech recognition accuracy of spoken queries and then improving retrieval accuracy in speechdriven Web retrieval. We experimentally evaluated the techniques of combining outputs of multiple LVCSR models in recognition of spoken queries. As model combination techniques, we compared the SVM learning technique with conventional voting schemes such as ROVER. In addition, for investigating the effects on the retrieval performance in vocabulary size of the language model, we prepared two kinds of language models: the one's vocabulary size was 20,000, the other's one was 60,000. Then, we evaluated the differences in the recognition rates of the spoken queries and the retrieval performance. We showed that the techniques of multiple LVCSR model combination could achieve improvement both in speech recognition and retrieval accuracies in speech-driven text retrieval. Comparing with the retrieval accuracies when an LM with a 20,000/60,000 vocabulary size is used in an LVCSR system, we found that the larger the vocabulary size is, the better the retrieval accuracy is. |
Year | DOI | Venue |
---|---|---|
2005 | 10.1093/ietisy/e88-d.3.472 | IEICE Transactions |
Keywords | Field | DocType |
vocabulary size,retrieval accuracy,speech recognition,speechdriven web retrieval,spoken queries,language model,retrieval performance,speech-driven web retrieval task,recognition rate,speech-driven web retrieval model,ntcir-3 web retrieval task,improving keyword recognition,combining multiple speech recognizer,speech-driven text retrieval | Web retrieval,Human–computer information retrieval,Pattern recognition,Voting,Computer science,Support vector machine,Speech recognition,Natural language processing,Artificial intelligence,Vocabulary,Text retrieval,Language model | Journal |
Volume | Issue | ISSN |
E88-D | 3 | 1745-1361 |
Citations | PageRank | References |
0 | 0.34 | 10 |
Authors | ||
4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Masahiko Matsushita | 1 | 6 | 1.33 |
Hiromitsu Nishizaki | 2 | 163 | 29.49 |
takehito utsuro | 3 | 456 | 82.76 |
Seiichi Nakagawa | 4 | 598 | 104.03 |