Title
Speaker Embedding Augmentation with Noise Distribution Matching
Abstract
Data augmentation (DA) is an effective strategy to help building robust systems with good generalization ability. In the embedding based speaker verification, data augmentation could be applied to either the front-end embedding extractor or the back-end PLDA. Unlike the conventional back-end augmentation method which adds noises to the raw audios and then extracts augmented embeddings, in this work, we proposed a noise distribution matching (NDM) based algorithm in the speaker embedding space. The basic idea is to use distributions such as Gaussian to model the difference between the clean and original augmented noisy speaker embeddings. Experiments are carried out on SRE16 dataset, where consistent performance improvement could be obtained by the novel NDM. Furthermore, we found that the proposed NDM could be robustly estimated using only a small amount of training data, which saves time and disk cost compared to the conventional augmentation method.
Year
DOI
Venue
2021
10.1109/ISCSLP49672.2021.9362090
2021 12th International Symposium on Chinese Spoken Language Processing (ISCSLP)
Keywords
DocType
ISBN
speaker embedding,data augmentation,distribution matching,speaker verification
Conference
978-1-7281-6995-8
Citations 
PageRank 
References 
0
0.34
0
Authors
6
Name
Order
Citations
PageRank
Xun Gong100.34
Zhengyang Chen285.17
Yexin Yang312.04
Shuai Wang400.34
Lan Wang500.68
Yanmin Qian629544.44