Abstract | ||
---|---|---|
We present a novel paradigm for sparse audio signal coding. After annihilating unperceivable components in some transform domain, the encoder buffers the resulting sparse vector into small non-overlapping frames. In each frame, the active elements' amplitudes are quantized, and with the help of a priori known unquantized “filler” vectors (whose values are placed in inactive positions), their position is encoded such that a certain function f of the filled vector is nearly integer valued. For this purpose, the quantized values of the sparse frames are pre-adjusted in a controlled manner with distortion in mind (hence the name “targeted dithering”). The decoder then progresses through the possible combinations of the nonzero elements, and verifies with the filler vector which of these combinations produces an integer valued f, thereby retrieving the active elements' positions. In preliminary tests, good quality can be obtained by encoding 44.1 kHz signals with less than 50 kbps. |
Year | Venue | Keywords |
---|---|---|
2010 | Aalborg | audio coding,combinatorial mathematics,combinatorial decoding,frequency 44.1 khz,sparse audio signal coding,targeted dithering,optimization,decoding,encoding |
Field | DocType | ISSN |
Integer,Audio signal,A priori and a posteriori,Algorithm,Speech recognition,Encoder,Dither,Decoding methods,Distortion,Mathematics,Encoding (memory) | Conference | 2219-5491 |
Citations | PageRank | References |
0 | 0.34 | 2 |
Authors | ||
6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Frédéric Mustière | 1 | 6 | 1.19 |
Hossein Najaf-Zadeh | 2 | 22 | 4.37 |
Ramin Pichevar | 3 | 56 | 9.92 |
Hassan Lahdili | 4 | 17 | 2.06 |
Louis Thibault | 5 | 20 | 3.21 |
Martin Bouchard | 6 | 172 | 29.67 |