Title
Learning Discriminative Speaker Embedding by Improving Aggregation Strategy and Loss Function for Speaker Verification
Abstract
The embedding-based speaker verification (SV) technology has witnessed significant progress due to the advances of deep convolutional neural networks (DCNN). However, how to improve the discrimination of speaker embedding in the open world SV task is still the focus of current research in the community. In this paper, we improve the discriminative power of speaker embedding from three-fold: (1) NeXtVLAD is introduced to aggregate frame-level features, which decomposes the high-dimensional frame-level features into a group of low-dimensional vectors before applying VLAD aggregation. (2) A multi-scale aggregation strategy (MSA) assembled with NeXtVLAD is designed with the purpose of fully extract speaker information from the frame-level feature in different hidden layers of DCNN. (3) A mutually complementary assembling loss function is proposed to train the model, which consists of a prototypical loss and a marginal-based softmax loss. Extensive experiments have been conducted on the VoxCeleb-1 dataset, and the experimental results show that our proposed system can obtain significant performance improvements compared with the baseline, and obtains new state-of-the-art results. The source code of this paper is available at https://github.com/LCF2764/Discriminative-Speaker-Embedding.
Year
DOI
Venue
2021
10.1109/IJCB52358.2021.9484331
2021 IEEE International Joint Conference on Biometrics (IJCB)
Keywords
DocType
ISSN
Discriminative Speaker Embedding,improving aggregation strategy,embedding-based speaker verification technology,deep convolutional neural networks,DCNN,open world SV task,discriminative power,frame-level feature,high-dimensional frame-level features,low-dimensional vectors,VLAD aggregation,multiscale aggregation strategy,speaker information,mutually complementary assembling loss function,prototypical loss,marginal-based softmax loss
Conference
2474-9680
ISBN
Citations 
PageRank 
978-1-6654-3781-3
0
0.34
References 
Authors
0
6
Name
Order
Citations
PageRank
Chengfang Luo100.34
Xin Guo200.34
Aiwen Deng300.34
Wei Xu400.34
Junhong Zhao5277.02
Wenxiong Kang6386.66