Title | ||
---|---|---|
Evaluating multiple LVCSR model combination in NTCIR-3 speech-driven web retrieval task |
Abstract | ||
---|---|---|
This paper studies speech-driven Web retrieval models which accepts spoken search topics (queries) in the NTCIR-3 Web retrieval task. The major focus of this paper is on improving speech recognition accuracy of spoken queries and then im- proving retrieval accuracy in speech-driven Web retrieval. We experimentally evaluate the techniques of combining outputs of multiple LVCSR models in recognition of spoken queries. As model combination techniques, we compare the SVM learning technique and conventional voting schemes such as ROVER. We show that the techniques of multiple LVCSR model com- bination can achieve improvement both in speech recognition and retrieval accuracies in speech-driven text retrieval. We also show that model combination by SVM learning outperforms conventional voting schemes both in speech recognition and re- trieval accuracies. |
Year | Venue | Keywords |
---|---|---|
2003 | INTERSPEECH | speech recognition |
Field | DocType | Citations |
Web retrieval,Human–computer information retrieval,Voting,Pattern recognition,Computer science,Support vector machine,Image retrieval,Speech recognition,Artificial intelligence,Text retrieval | Conference | 4 |
PageRank | References | Authors |
0.59 | 10 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Masahiko Matsushita | 1 | 6 | 1.33 |
Hiromitsu Nishizaki | 2 | 163 | 29.49 |
takehito utsuro | 3 | 456 | 82.76 |
Yasuhiro Kodama | 4 | 19 | 2.87 |
Seiichi Nakagawa | 5 | 598 | 104.03 |