Title
Speech-Codebook Based Soft Voice Activity Detection
Abstract
A novel noise-robust soft Voice Activity Detector (VAD) operating in the short-time Fourier domain is presented. A speech energy gain is obtained by frame-wise processing of a noisy speech signal with a speech codebook algorithm. This gain can be used for robust voice detection. A speaker-independent speech codebook, consisting of spectral envelopes, is created in the training process. While applying the algorithm, the codebook is adapted in every frame to the current speaker by combining the harmonic pitch structure of the actual noisy speech frame with the codebook entries. Soft VAD values ranging from zero to one are calculated by post-processing of the speech gain which is obtained using gain shape vector quantization. A binary VAD is carried out by applying a threshold. The proposed method does not rely on noise a-priori knowledge and is robust w.r.t. highly non-stationary noise and adverse SNR conditions. In addition, it is possible to compromise between the detection-rate and the false-alarm-rate by varying a threshold without increasing the total number of mis-detections. Compared to state-of-the-art VAD systems, the proposed method is characterized by better detection-rates at significant lower false-alarm-rates.
Year
Venue
Keywords
2015
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP)
Voice activity detection, Codebook, Noise robust
Field
DocType
ISSN
Speech processing,Speech coding,Noise measurement,Pattern recognition,Voice activity detection,Computer science,Speech recognition,Vector quantization,Artificial intelligence,Codec2,Linear predictive coding,Codebook
Conference
1520-6149
Citations 
PageRank 
References 
2
0.38
7
Authors
3
Name
Order
Citations
PageRank
Florian Heese1505.25
Markus Niermann220.38
Peter Vary385275.52