Title
I-vector based language modeling for spoken document retrieval
Abstract
Since more and more multimedia data associated with spoken documents have been made available to the public, spoken document retrieval (SDR) has become an important research subject in the past two decades. The i-vector based framework has been proposed and introduced to language identification (LID) and speaker recognition (SR) tasks recently. The major contribution of the i-vector framework is to reduce a series of acoustic feature vectors of a speech utterance to a low-dimensional vector representation, and then numbers of well-developed postprocessing techniques (such as probabilistic linear discriminative analysis, PLDA) can be readily and effectively used. However, to our best knowledge, there is no research up to date on applying the i-vector framework for SDR or information retrieval (IR). In this paper, we make a step forward to formulate an i-vector based language modeling (IVLM) framework for SDR. Furthermore, we evaluate the proposed IVLM framework with both inductive and transductive learning strategies. We also exploit multi-levels of index features, including word- and subword-level units, in concert with the proposed framework. The results of SDR experiments conducted on the TDT-2 (Topic Detection and Tracking) collection demonstrate the performance merits of our proposed framework when compared to several existing approaches.
Year
DOI
Venue
2014
10.1109/ICASSP.2014.6854974
ICASSP
Keywords
Field
DocType
inductive,lid,transductive learning strategy,language modeling,i-vector,tdt-2 collection,plda,i-vector based framework,language identification,information retrieval,speech utterance,learning by example,speaker recognition,sdr,spoken document retrieval,speaker recognition task,ivlm framework,postprocessing techniques,transductive,subword-level unit,topic detection and tracking collection,acoustic feature vector,i-vector based language modeling framework,inductive learning strategy,multimedia data,probabilistic linear discriminative analysis,low-dimensional vector representation,probability,semantics,probabilistic logic,vectors,indexes
Transduction (machine learning),Divergence-from-randomness model,Feature vector,Pattern recognition,Computer science,Speaker recognition,Language identification,Natural language processing,Artificial intelligence,Vector space model,Document retrieval,Language model
Conference
ISSN
Citations 
PageRank 
1520-6149
5
0.40
References 
Authors
16
5
Name
Order
Citations
PageRank
Kuan-Yu Chen1192.97
Hung-Shin Lee2539.76
Hsin-min Wang31201129.62
Berlin Chen415134.59
Hsin-hsi Chen52267233.93