Biomimetic Multi-Resolution Analysis for Robust Speaker Recognition. - Citegraph

Paper Info

Title
Biomimetic Multi-Resolution Analysis for Robust Speaker Recognition.

Abstract
Humans exhibit a remarkable ability to reliably classify sound sources in the environment even in presence of high levels of noise. In contrast, most engineering systems suffer a drastic drop in performance when speech signals are corrupted with channel or background distortions. Our brains are equipped with elaborate machinery for speech analysis and feature extraction, which hold great lessons for improving the performance of automatic speech processing systems under adverse conditions. The work presented here explores a biologically-motivated multi-resolution speaker information representation obtained by performing an intricate yet computationally-efficient analysis of the information-rich spectro-temporal attributes of the speech signal. We evaluate the proposed features in a speaker verification task performed on NIST SRE 2010 data. The biomimetic approach yields significant robustness in presence of non-stationary noise and reverberation, offering a new framework for deriving reliable features for speaker recognition and speech processing.

Year	DOI	Venue
2012	10.1186/1687-4722-2012-22	EURASIP J. Audio, Speech and Music Processing
Keywords	Field	DocType
Speech Signal, Equal Error Rate, Speaker Recognition, Speaker Verification, Universal Background Model	Speech processing,Reverberation,Pattern recognition,Computer science,Voice activity detection,Communication channel,Feature extraction,Speech recognition,Robustness (computer science),Speaker recognition,NIST,Artificial intelligence	Journal
Volume	Issue	ISSN
2012	1	1687-4722
Citations	PageRank	References
3	0.41	12
Authors
4

Authors (4 rows)

Cited by (3 rows)

References (12 rows)

Name	Order	Citations	PageRank
Sridhar Krishna Nemala	1	54	5.36
Dmitry N. Zotkin	2	171	19.06
Ramani Duraiswami	3	1721	161.98
Mounya Elhilali	4	136	19.02

1