Abstract | ||
---|---|---|
A novel concept for perceptual audio coding is presented which is based on the combination of a pre- and post-filter, controlled by a psychoacoustic model, with a transform coding scheme. This paradigm allows modeling of the temporal and spectral shape of the masked threshold with a resolution independent of the used transform. By using frequency warping techniques the maximum possible detail for a given filter order can be made frequency-dependent and thus better adapted to the human auditory system. The filter coefficients are represented efficiently by LSF parameters which can be adaptively interpolated over time. First experiments with a system obtained by extending an existing transform codec showed that this approach can significantly improve the performance for speech signals, while the performance for other signals remained the same. |
Year | DOI | Venue |
---|---|---|
2000 | 10.1109/ICASSP.2000.859101 | ICASSP |
Keywords | Field | DocType |
psychoacoustic pre,coding scheme,human auditory system,filter order,spectral shape,lsf parameter,filter coefficient,psychoacoustic model,maximum possible detail,novel concept,perceptual audio coding,filter coefficients,transform coding,psychology,codecs,interpolation,resolution,performance,frequency,adaptive filters,modeling,speech coding | Image warping,Speech coding,Pattern recognition,Computer science,Transform coding,Coding (social sciences),Speech recognition,Artificial intelligence,Sub-band coding,Adaptive filter,Codec,Filter design | Conference |
ISSN | ISBN | Citations |
1520-6149 | 0-7803-6293-4 | 18 |
PageRank | References | Authors |
3.19 | 2 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
B Edler | 1 | 46 | 9.21 |
G. Schuller | 2 | 32 | 5.32 |