Abstract | ||
---|---|---|
Detecting when voice is or is not present is an outstanding problem for speech transmission, enhancement and recognition. Here we present a novel multichannel source activity detector that exploits the spatial localization of the target audio source. The detector uses an array signal processing technique to maximize the signal-to-interference ratio for the target source thus decreasing the activity detection error rate. We compare our two-channel voice activity detector (VAD) with the AMR voice detection algorithms on real data recorded in a noisy car environment. The new algorithm shows improvements in error rates of 55-70% compared to the state-of-the-art adaptive multi-rate algorithm AMR2 used in present voice transmission technology. |
Year | Venue | DocType |
---|---|---|
2002 | European Signal Processing Conference | Conference |
ISSN | Citations | PageRank |
2219-5491 | 2 | 0.56 |
References | Authors | |
1 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Justinian Rosca | 1 | 344 | 45.50 |
Radu Balan | 2 | 59 | 10.98 |
N. P. Fan | 3 | 2 | 0.56 |
Christophe Beaugeant | 4 | 148 | 20.60 |
Virginie Gilg | 5 | 2 | 0.90 |