Abstract | ||
---|---|---|
Speech coders provide high speech quality at low rates. However they perform poorly when encoding non-speech signals. This paper proposes a new enhancement algorithm requiring minimum side information to reduce the effect of this shortcoming. The enhancement algorithm consists of post-processing the speech decoder output in the spectral domain. Specifically, some frequency components are reduced or forced to zero when the corresponding frequency content is poorly described by the speech coder. The choice of modifying spectral components is determined at the encoder. thus requiring to transmit the decision information. Experiments combining the AMR-WB speech codec and the proposed audio enhancement show that the quality for music signals is improved significantly while not affecting the quality for speech inputs. |
Year | DOI | Venue |
---|---|---|
2005 | 10.1109/ICASSP.2005.1416038 | 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING |
Keywords | Field | DocType |
speech synthesis,frequency,multiple signal classification,speech coding,decoding,encoding | Speech enhancement,Speech processing,Speech coding,Pattern recognition,Voice activity detection,Computer science,Adaptive Multi-Rate audio codec,PSQM,Speech recognition,Artificial intelligence,Codec2,Linear predictive coding | Conference |
ISSN | Citations | PageRank |
1520-6149 | 1 | 0.40 |
References | Authors | |
4 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Guillaume Fuchs | 1 | 38 | 7.84 |
Lefebvre, R. | 2 | 93 | 18.55 |