Title
Variational Bayes based I-vector for speaker diarization of telephone conversations
Abstract
In this paper, we investigate the variational Bayes based I-vector method for speaker diarization of telephone conversations. The motivation of the proposed algorithm is to utilize variational Bayesian framework and exploit potential channel effect of total variability modeling for diarization of conversation side. Other three well-known techniques are compared as follows: K-means clustering for eigenvoices and I-vector speaker diarization, and variational Bayes applied to eigenvoices. Performance evaluations are conducted on the summed-channel telephone data from the 2008 NIST speaker recognition evaluation. The paper discusses how the performance is influenced by different modules, e.g., VAD, initial speaker clustering and Viterbi re-segmentation. Comparison experiments show the interest of variational Bayesian probabilistic framework for speaker diarization.
Year
DOI
Venue
2014
10.1109/ICASSP.2014.6853564
ICASSP
Keywords
Field
DocType
telephone conversations,telephone sets,variational techniques,pattern clustering,summed-channel telephone,vad,bayes methods,potential channel effect,variational bayes,speaker recognition,total variability,variational bayes based i-vector method,eigenvoices,i-vector,viterbi resegmentation,speaker clustering,performance evaluation,speaker diarization,eigenvalues and eigenfunctions,total variability modeling,viterbi detection,speech processing,clustering algorithms,vectors,viterbi algorithm,speech
I vector,Pattern recognition,Computer science,Speech recognition,Speaker recognition,NIST,Artificial intelligence,Speaker diarisation,Cluster analysis,Viterbi algorithm,Bayesian probability,Bayes' theorem
Conference
ISSN
Citations 
PageRank 
1520-6149
3
0.43
References 
Authors
10
4
Name
Order
Citations
PageRank
Rong Zheng1503.50
Ce Zhang2503.17
Shanshan Zhang3534.24
Bo Xu41309.43