Title
Generalized end-to-end detection of spoofing attacks to automatic speaker recognizers
Abstract
As automatic speaker recognizer systems become mainstream, voice spoofing attacks are on the rise. Common attack strategies include replay, the use of text-to-speech synthesis, and voice conversion systems. While previously-proposed end-to-end detection frameworks have shown to be effective in spotting attacks for one particular spoofing strategy, they have relied on different models, architectures, and speech representations, depending on the spoofing strategy. In practice, however, one does not have a priori information regarding the strategy an attacker might employ to fool a speaker recognizer, thus it is necessary to devise approaches which are able to detect attacks regardless of the strategy employed to generate them. In this work, we introduce an end-to-end ensemble based approach such that two models – previously shown to perform well on each considered attack strategy – are trained jointly, while a third model learns how to mix their outputs yielding a single score. Experimental results with replay and text-to-speech/voice conversion attacks show the proposed ensemble method achieving similar or superior performance when compared to systems specialized on each spoofing strategy separately.
Year
DOI
Venue
2020
10.1016/j.csl.2020.101096
Computer Speech & Language
Keywords
DocType
Volume
Voice biometrics,Presentation attacks detection,Speaker verification,Convolutional neural networks
Journal
63
ISSN
Citations 
PageRank 
0885-2308
1
0.36
References 
Authors
0
3
Name
Order
Citations
PageRank
João Bosco Oliveira Monteiro1248.87
jahangir alam232038.69
Tiago H. Falk352565.20