Title | ||
---|---|---|
Speaker diarization using unsupervised discriminant analysis of inter-channel delay features |
Abstract | ||
---|---|---|
When multiple microphones are available estimates of inter-channel delay, which characterise a speaker-s location, can be used as features for speaker diarization. Background noise and reverberation can, however, lead to noisy features and poor performance. To ameliorate these problems, this paper presents a new approach to the discriminant analysis of delay features for speaker diarization. This novel and nonetheless unsupervised approach aims to increase speaker separability in delay-space. We assess the approach on subsets of four standard NIST RT datasets and demonstrate a relative improvement in diarization error rate of 25% on a separate evaluation set using delay features alone. |
Year | DOI | Venue |
---|---|---|
2009 | 10.1109/ICASSP.2009.4960520 | ICASSP |
Keywords | Field | DocType |
loudspeakers,speaker diarization,speaker recognition,feature extraction,viterbi algorithm,principal component analysis,nist,background noise,error rate,speech,discriminant analysis,reverberation,acoustics | Reverberation,Background noise,Pattern recognition,Computer science,Communication channel,Speech recognition,NIST,Speaker recognition,Speaker diarisation,Artificial intelligence,Linear discriminant analysis,Principal component analysis | Conference |
ISSN | Citations | PageRank |
1520-6149 | 9 | 0.59 |
References | Authors | |
5 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
nicholas evans | 1 | 594 | 54.41 |
corinne fredouille | 2 | 537 | 44.53 |
Jean-François Bonastre | 3 | 106 | 13.02 |