Abstract | ||
---|---|---|
Speech coders operating in time domain can be extended with a frequency domain mode to improve encoding of music, even though this is challenging at low delay. In such a scenario, the short analysis window limits the benefit of the transform coder, while a delayless switch between the two coders constrains the system further. The paper presents an LPC and MDCT-based audio coder part of the new 3GPP codec for Enhanced Voice Services, which aims to solve the issues. Several advanced coding tools are introduced to alleviate the constraints: transient handling is improved, harmonic structures are better preserved, and the modeling of the zero-quantized frequencies is enhanced. Test results show that the obtained low-delay switched coder brings a clear improvement over a speech coder and is competitive even in comparison to audio coders with higher delay. |
Year | Venue | Keywords |
---|---|---|
2015 | 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP) | Audio coding, Low delay, LPC, MDCT, EVS |
Field | DocType | ISSN |
Frequency domain,Speech coding,Adaptive Multi-Rate audio codec,Computer science,Speech recognition,Coding (social sciences),Sub-band coding,Decoding methods,Codec,Encoding (memory) | Conference | 1520-6149 |
Citations | PageRank | References |
9 | 0.96 | 7 |
Authors | ||
6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Guillaume Fuchs | 1 | 38 | 7.84 |
Christian R. Helmrich | 2 | 30 | 4.87 |
Goran Markovic | 3 | 11 | 2.42 |
Matthias Neusinger | 4 | 9 | 0.96 |
Emmanuel Ravelli | 5 | 38 | 4.70 |
Takehiro Moriya | 6 | 89 | 24.08 |