Title
Calibration and multiple system fusion for spoken term detection using linear logistic regression.
Abstract
State-of-the-art calibration and fusion approaches for spoken term detection (STD) systems currently rely on a multi-pass approach where the scores are calibrated, then fused, and finally re-calibrated to obtain a single decision threshold across keywords. While the above techniques are theoretically correct, they rely on meta-parameter tuning and are prone to over-fitting. This study presents an efficient and effective score calibration technique for keyword detection that is based on the logistic regression calibration approach commonly used in forensic speaker identification. The technique applies seamlessly to both single systems and to system fusion, and enables optimization for specific keyword detection evaluation functions. We run experiments on a Vietnamese STD task, comparing the technique with more empirical calibration and fusion schemes and demonstrate that we can achieve comparable or better performance in terms of the NIST ATWV metric with a more elegant solution.
Year
DOI
Venue
2014
10.1109/ICASSP.2014.6854985
ICASSP
Keywords
Field
DocType
logistics,optimization,speaker recognition,vectors,calibration,speech,over fitting,hidden markov models,regression analysis,nist,training data
Speaker identification,Pattern recognition,Computer science,Fusion,NIST,Artificial intelligence,Logistic regression,Machine learning,Calibration
Conference
ISSN
Citations 
PageRank 
1520-6149
4
0.44
References 
Authors
10
7
Name
Order
Citations
PageRank
Julien van Hout1546.07
Luciana Ferrer259748.55
Dimitra Vergyri337336.97
Nicolas Scheffer435423.77
Yun Lei530621.55
Vikramjit Mitra629924.83
Steven Wegmann740.44