Title
Transforming acoustic characteristics to deceive playback spoofing countermeasures of speaker verification systems
Abstract
Automatic speaker verification (ASV) systems use a playback detector to filter out playback attacks and ensure verification reliability. Since current playback detection models are almost always trained using genuine and playedback speech, it may be possible to degrade their performance by transforming the acoustic characteristics of the played-back speech close to that of the genuine speech. One way to do this is to enhance speech “stolen” from the target speaker before playback. We tested the effectiveness of a playback attack using this method by using the speech enhancement generative adversarial network to transform acoustic characteristics. Experimental results showed that use of this “enhanced stolen speech” method significantly increases the equal error rates for the baseline used in the ASVspoof 2017 challenge and for a light convolutional neural network-based method. The results also showed that its use degrades the performance of a Gaussian mixture modeluniversal background model-based ASV system. This type of attack is thus an urgent problem needing to be solved.
Year
DOI
Venue
2018
10.1109/WIFS.2018.8630764
2018 IEEE International Workshop on Information Forensics and Security (WIFS)
Keywords
DocType
Volume
Gaussian mixture model,speech enhancement,generative adversarial network,playback detection models,universal background model-based ASV system,convolutional neural network,target speaker,playedback speech,genuine speech,playback attack,playback detector,automatic speaker verification systems,playback spoofing countermeasures,enhanced stolen speech method,acoustic characteristics
Conference
abs/1809.04274
ISSN
ISBN
Citations 
2157-4766
978-1-5386-6537-4
1
PageRank 
References 
Authors
0.37
25
5
Name
Order
Citations
PageRank
Fuming Fang110.37
junichi yamagishi21906145.51
Isao Echizen329968.82
Md. Sahidullah432624.99
Tomi Kinnunen5132386.67