Abstract | ||
---|---|---|
Discontinuous Transmission (DTX) is an efficient way to drastically reduce the transmission rate of a communication codec in the absence of voice input. In this mode, most frames that are determined to consist of background noise only are dropped from transmission and replaced by some Comfort Noise Generation (CNG) in the decoder. In this paper, we propose a novel CNG approach combining information gained about the actual background noise at both encoder and decoder side. It is able to better reproduce background noise types showing a pronounced spectral tilt, which is difficult for traditional schemes based on a linear prediction model. The proposed technique operates in the frequency domain. It is part of the Enhanced Voice Services (EVS) codec, where it is known as FD-CNG. Listening tests show the superior quality of FD-CNG over existing approaches for certain background noise such as car noise. |
Year | Venue | Keywords |
---|---|---|
2015 | 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP) | speech coding, audio coding, CNG, DTX, EVS |
Field | DocType | ISSN |
Background noise,Noise (signal processing),Noise measurement,Computer science,Electronic engineering,Artificial intelligence,Codec,Frequency domain,Pattern recognition,Adaptive Multi-Rate audio codec,Speech recognition,Discontinuous transmission,Encoder | Conference | 1520-6149 |
Citations | PageRank | References |
2 | 0.48 | 4 |
Authors | ||
6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Anthony Lombard | 1 | 51 | 7.68 |
Stephan Wilde | 2 | 7 | 1.14 |
Emmanuel Ravelli | 3 | 6 | 1.29 |
stefan dohla | 4 | 3 | 0.84 |
Guillaume Fuchs | 5 | 38 | 7.84 |
Martin Dietz | 6 | 2 | 0.82 |