Title | ||
---|---|---|
Voice activity detection based on statistical likelihood ratio with adaptive thresholding |
Abstract | ||
---|---|---|
Statistical likelihood ratio test is a widely used voice activity detection (VAD) method, in which the likelihood ratio of the current temporal frame is compared with a threshold. A fixed threshold is always used, but this is not suitable for various types of noise. In this paper, an adaptive threshold is proposed as a function of the local statistics of the likelihood ratio. This threshold represents the upper bound of the likelihood ratio for the non-speech frames, whereas it remains generally lower than the likelihood ratio for the speech frames. As a result, a high non-speech hit rate can be achieved, while maintaining speech hit rate as large as possible. |
Year | DOI | Venue |
---|---|---|
2016 | 10.1109/IWAENC.2016.7602911 | 2016 IEEE International Workshop on Acoustic Signal Enhancement (IWAENC) |
Keywords | Field | DocType |
voice activity detection,likelihood ratio test,adaptive threshold,high non-speech hit rate | Hit rate,Likelihood-ratio test,Pattern recognition,Noise measurement,Voice activity detection,Upper and lower bounds,Signal-to-noise ratio,Speech recognition,Local statistics,Artificial intelligence,Thresholding,Mathematics | Conference |
ISBN | Citations | PageRank |
978-1-5090-2008-9 | 1 | 0.36 |
References | Authors | |
11 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Xiaofei Li | 1 | 103 | 24.78 |
Radu Horaud | 2 | 2776 | 261.99 |
Laurent Girin | 3 | 473 | 53.76 |
Sharon Gannot | 4 | 1754 | 130.51 |