Title
Utd-Crss Systems For 2016 Nist Speaker Recognition Evaluation
Abstract
This study describes systems submitted by the Center for Robust Speech Systems (CRSS) from the University of Texas at Dallas (UTD) to the 2016 National Institute of Standards and Technology (NIST) Speaker Recognition Evaluation (SRE). We developed 4 UBM and DNN i-vector based speaker recognition systems with alternate data sets and feature representations. Given that the emphasis of the NIST SRE 2016 is on language mismatch between training and enrollment/test data. so-called domain mismatch. in our system development we focused on: (i) utilizing unlabeled in-domain data for centralizing i-vectors to alleviate the domain mismatch; (ii) selecting the proper data sets and optimizing configurations for training LDA/PLDA; (iii) introducing a newly proposed dimension reduction technique which incorporates unlabeled in-domain data before PLDA training: (iv) unsupervised speaker clustering of unlabeled data and using them alone or with previous SREs for PLDA training, and finally (v) score calibration using unlabeled data with "pseudo" speaker labels generated from speaker clustering. NIST evaluations show that our proposed methods were very successful for the given task.
Year
DOI
Venue
2017
10.21437/Interspeech.2017-555
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION
Keywords
DocType
Volume
NIST SRE, speaker recognition, domain mismatch, i-vector, speaker clustering
Conference
abs/1610.07651
ISSN
Citations 
PageRank 
2308-457X
1
0.35
References 
Authors
0
6
Name
Order
Citations
PageRank
Chunlei Zhang1377.43
Fahimeh Bahmaninezhad242.77
Shivesh Ranjan3636.10
Chengzhu Yu4163.77
Navid Shokouhi5356.43
John H. L. Hansen63215365.75