Title
Spoofing Speech Detection Using High Dimensional Magnitude And Phase Features: The Ntu Approach For Asvspoof 2015 Challenge
Abstract
Recent improvement in text-to-speech (TTS) and voice conversion (VC) techniques presents a threat to automatic speaker verification (ASV) systems. An attacker can use the TTS or VC systems to impersonate a target speaker's voice. To overcome such a challenge, we study the detection of such synthetic speech (called spoofing speech) in this paper. We propose to use high dimensional magnitude and phase based features and long term temporal information for the task. In total, 2 types of magnitude based features and 5 types of phase based features are used. For each feature type, we build a component system using a multilayer perceptron to predict the posterior probabilities of the input features extracted from spoofing speech. The probabilities of all component systems are averaged to produce the score for final decision. When tested on the ASVspoof 2015 benchmarking task, an equal error rate (EER) of 0.29% is obtained for known spoofing types, which demonstrates the highly effectiveness of the 7 features used. For unknown spoofing types, the EER is much higher at 5.23%, suggesting that future research should be focused on improving the generalization of the techniques.
Year
Venue
Keywords
2015
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5
Spoofing attack, voice conversion, automatic speaker verification, phase feature, ASVspoof 2015
Field
DocType
Citations 
Magnitude (mathematics),Pattern recognition,Spoofing attack,Voice activity detection,Computer science,Speech recognition,Artificial intelligence
Conference
19
PageRank 
References 
Authors
0.62
0
6
Name
Order
Citations
PageRank
Xiong Xiao128134.97
Xiaohai Tian26411.83
Steven Du3190.96
haihua xu4262.72
Eng Siong Chng5970106.33
Haizhou Li63678334.61