Title
Test Collections for Spoken Document Retrieval from Lecture Audio Data.
Abstract
The Spoken Document Processing Working Group, which is part of the special interest group of spoken language processing of the Information Processing Society of Japan, is developing a test collection for evaluation of spoken document retrieval systems. A prototype of the test collection consists of a set of textual queries, relevant segment lists, and transcriptions by an automatic speech recognition system, allowing retrieval from the Corpus of Spontaneous Japanese (CSJ). From about 100 initial queries, application of the criteria that a query should have more than five relevant segments that consist of about one minute speech segments yielded 39 queries. Targeting the test collection, an ad hoc retrieval experiment was also conducted to assess the baseline retrieval performance by applying a standard method for spoken document retrieval.
Year
Venue
Field
2008
SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008
Transcription (linguistics),Information processing,Spoken language processing,Information retrieval,Computer science,Document processing,Speech recognition,Natural language processing,Artificial intelligence,Document retrieval,Special Interest Group
DocType
Citations 
PageRank 
Conference
5
0.68
References 
Authors
11
9
Name
Order
Citations
PageRank
Tomoyosi Akiba117629.08
Kiyoaki Aikawa218628.87
Yoshiaki Itoh37014.81
Tatsuya Kawahara41352196.52
Hiroaki Nanjo512816.33
Hiromitsu Nishizaki616329.49
Norihito Yasuda77212.56
Yoichi Yamashita8379.26
Katunobu Itou931944.36