Title
Voice activity detection based on statistical likelihood ratio with adaptive thresholding
Abstract
Statistical likelihood ratio test is a widely used voice activity detection (VAD) method, in which the likelihood ratio of the current temporal frame is compared with a threshold. A fixed threshold is always used, but this is not suitable for various types of noise. In this paper, an adaptive threshold is proposed as a function of the local statistics of the likelihood ratio. This threshold represents the upper bound of the likelihood ratio for the non-speech frames, whereas it remains generally lower than the likelihood ratio for the speech frames. As a result, a high non-speech hit rate can be achieved, while maintaining speech hit rate as large as possible.
Year
DOI
Venue
2016
10.1109/IWAENC.2016.7602911
2016 IEEE International Workshop on Acoustic Signal Enhancement (IWAENC)
Keywords
Field
DocType
voice activity detection,likelihood ratio test,adaptive threshold,high non-speech hit rate
Hit rate,Likelihood-ratio test,Pattern recognition,Noise measurement,Voice activity detection,Upper and lower bounds,Signal-to-noise ratio,Speech recognition,Local statistics,Artificial intelligence,Thresholding,Mathematics
Conference
ISBN
Citations 
PageRank 
978-1-5090-2008-9
1
0.36
References 
Authors
11
4
Name
Order
Citations
PageRank
Xiaofei Li110324.78
Radu Horaud22776261.99
Laurent Girin347353.76
Sharon Gannot41754130.51