Title
Adaptive Regularization Framework For Robust Voice Activity Detection
Abstract
Traditional VAD algorithms work well under clean conditions, their performance however decreases drastically in noisy environments. We have investigated the tradeoff between false acceptance rate (FAR) and false rejection rate (FRR) in VAD with the consideration of noise reduction and speech distortion problem in speech enhancement, and proposed a regularization framework for noise reduction in designing VAD algorithms. In the framework, the balance between FAR and FRR was implicitly controlled by using a regularization parameter. In addition, the regularization was done in a reproducing kernel Hilbert space (RKHS) which made it easy to apply a non-linear transform function via "kernel trick" for noise reduction. Under this framework, a better tradeoff between FAR and FRR was obtained in VAD. Considering the non-stationarity property of speech and noise, in this study, an adaptive regularization framework was further developed in which the regularization parameter was changed adaptively according to local variations of the signal to noise ratio (SNR). We tested our algorithm on VAD experiments, and compared it with several typical VAD algorithms. The results showed that the proposed algorithm could be used to improve the robustness of VAD.
Year
Venue
Keywords
2011
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5
Noise reduction, voice activity detection, reproducing kernel Hilbert space
Field
DocType
Citations 
Pattern recognition,Computer science,Voice activity detection,Speech recognition,Artificial intelligence,Adaptive regularization
Conference
5
PageRank 
References 
Authors
0.60
1
5
Name
Order
Citations
PageRank
Xugang Lu118033.11
Masashi Unoki213846.07
Ryosuke Isotani33810.60
Hisashi Kawai425054.04
Satoshi Nakamura51099194.59