Title
On The Use Of Blind Channel Response Estimation And A Residual Neural Network To Detect Physical Access Attacks To Speaker Verification Systems
Abstract
Spoofing attacks have been acknowledged as a serious threat to automatic speaker verification (ASV) systems. In this paper, we are specifically concerned with replay attack scenarios. As a countermeasure to the problem, we propose a front-end based on the blind estimation of the channel response magnitude and as a back-end a residual neural network. Our hypothesis is that the magnitude response of the channel, obtained by subtracting the log-magnitude spectrum of the observed signal from the prediction of the log-magnitude spectrum average of the observed signal's clean counterpart, will capture the nuances of room ambiences, recordings and playback devices. The performance of these features is investigated on a benchmark back-end, based on a Gaussian mixture model and on a deep neural network classifier. Our experiments are performed on the 2017 and 2019 Automatic Speaker Verification Spoofing and Countermeasures Challenge (ASVspoof) datasets. The benchmark systems are the same as used in the challenges and are based on constant-Q cepstral coefficients (CQCC) and linear-frequency cepstral coefficients (LFCC) features. Experimental results on the 2017 dataset show that the proposed method outperforms the two benchmarks, providing equal-error rates (EER) as low as 7.57% and 11.64%, respectively, for the development and evaluation sets. On the ASVspoof 2019 dataset, in turn, the proposed method outperformed the benchmark using a residual neural network as back-end by yielding tandem detection cost function (t-DCF) and EER as low as 0.1086 and 4.26% on the evaluation set. Lastly, an instrumental (objective) quality assessment is performed on the two datasets and the impact of quality variability on spoofing detection accuracy is discussed. Crown Copyright (C) 2020 Published by Elsevier Ltd. All rights reserved.
Year
DOI
Venue
2021
10.1016/j.csl.2020.101163
COMPUTER SPEECH AND LANGUAGE
Keywords
DocType
Volume
Automatic speaker recognition, Spoofing attacks, Replay attack, Channel estimation
Journal
66
ISSN
Citations 
PageRank 
0885-2308
0
0.34
References 
Authors
0
5
Name
Order
Citations
PageRank
Anderson R. Avila132.77
jahangir alam232038.69
Fabiano O. Costa Prado300.34
Douglas D. O'Shaughnessy439884.79
Tiago H. Falk552565.20