Abstract | ||
---|---|---|
This paper proposes a new model-based method for transform coding of audio signals. The input signal is mapped in "perceptual" domain by linear-predictive weighting filter followed by modified discrete cosine transform (MDCT). To provide bitstream scalability, model-based bit plane coding is then applied with respect to the mean square error (MSE) criterion. We present methods to estimate the symbol probability in bit planes assuming a generalized Gaussian model for the distribution of MDCT coefficients. We compare the performance of the proposed bitstream scalable coder with stack-run coding and ITU-T G.722.1. Objective and subjective quality results are presented. The proposed coder is equivalent to or slightly worse than reference coders, but presents the nice advantage of being scalable. Performance penalty due to bitstream scalability is evident at low bitrates. |
Year | DOI | Venue |
---|---|---|
2008 | 10.1109/ICASSP.2008.4518534 | ICASSP |
Keywords | Field | DocType |
modified discrete cosine transform,bitstream scalable coder,audio signals,audio coding,transform coding,symbol probability,gaussian distribution,mean square error criterion,discrete cosine transforms,model-based bit plane coding,linear-predictive weighting filter,bitstream scalability,itu-t g.722.1,generalized gaussian model,mean square error methods,embedded transform coding,stack-run coding,mean square error | Audio signal,Weighting filter,Pattern recognition,Modified discrete cosine transform,Computer science,Transform coding,Coding (social sciences),Sub-band coding,Artificial intelligence,Harmonic Vector Excitation Coding,Bitstream | Conference |
ISSN | ISBN | Citations |
1520-6149 E-ISBN : 978-1-4244-1484-0 | 978-1-4244-1484-0 | 1 |
PageRank | References | Authors |
0.43 | 6 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Thi Minh Nguyet Hoang | 1 | 1 | 1.11 |
marie oger | 2 | 12 | 2.82 |
Stéphane Ragot | 3 | 73 | 9.92 |
Marc Antonini | 4 | 1 | 0.77 |