A Comparison Of Features For Synthetic Speech Detection - Citegraph

Paper Info

Title
A Comparison Of Features For Synthetic Speech Detection

Abstract
The performance of biometric systems based on automatic speaker recognition technology is severely degraded due to spoofing attacks with synthetic speech generated using different voice conversion (VC) and speech synthesis (SS) techniques. Various countermeasures are proposed to detect this type of attack, and in this context, choosing an appropriate feature extraction technique for capturing relevant information from speech is an important issue. This paper presents a concise experimental review of different features for synthetic speech detection task. A wide variety of features considered in this study include previously investigated features as well as some other potentially useful features for characterizing real and synthetic speech. The experiments are conducted on recently released ASVspoof 2015 corpus containing speech data from a large number of VC and SS technique. Comparative results using two different classifiers indicate that features representing spectral information in high-frequency region, dynamic information of speech, and detailed information related to subband characteristics are considerably more useful in detecting synthetic speech.

Year	Venue	Keywords
2015	16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5	anti-spoofing, ASVspoof 2015, feature extraction, countermeasures
Field	DocType	Citations
Speech synthesis,Spoofing attack,Pattern recognition,Computer science,Voice activity detection,Feature extraction,Speech recognition,Artificial intelligence,Biometrics,Automatic speaker recognition,Anti spoofing	Conference	34
PageRank	References	Authors
1.10	15	3

Authors (3 rows)

Cited by (34 rows)

References (15 rows)

Name	Order	Citations	PageRank
Md. Sahidullah	1	326	24.99
Tomi Kinnunen	2	1323	86.67
Cemal Hanilçi	3	171	11.23

1