Abstract | ||
---|---|---|
A scalable codec has been constructed by using transform coding and the basic modules for scalable encoder and decoder. It allows users to choose a variety of scalable configurations in the frequency domain. The basic module is a quantizer that can quantize MDCT (modified DCT) coefficients transformed from a variety of frequency regions. This module mainly works at bit rates of more than 8 kbit/s. We can also change the target frequency regions of the basic module's input-output signals in each transform frame; i.e., we can change the scalable structure according to the nature of the input signals. In the scalable codec described here, the input-output signals are monaural and the sampling frequency is 24 kHz. The total bit rate of this scalable codec is more than 8 kbit/s. Subjective quality evaluation tests, mainly for musical sound sources, showed that it's sound quality is better than that of an MPEG-2 layer 3 codec at 8, 16, and 24 kbit/s when our scalable codec is constructed of 8-kbit/s basic modules. In combination with AAC (advanced audio coding), our scalable codec will be chosen as an international standard in ISO/IEC-MPEG-4/Audio. |
Year | DOI | Venue |
---|---|---|
1999 | 10.1109/ICASSP.1999.759816 | Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference |
Keywords | Field | DocType |
audio coding,code standards,codecs,discrete cosine transforms,telecommunication standards,transform coding,vector quantisation,16 kbit/s,24 kHz,24 kbit/s,8 kbit/s,ISO/IEC-MPEG-4/Audio,MDCT coefficients,MPEG-2 layer 3 codec,VQ,advanced audio coding,bit rates,frequency domain,frequency regions,input-output signals,international standard,modified DCT,musical sound sources,quantizer units,sampling frequency,scalable audio coder,scalable codec,scalable decoder,scalable encoder,sound quality,subjective quality evaluation tests,transform coding,transform frame | Computer science,Discrete cosine transform,Sound quality,Artificial intelligence,Computer hardware,Codec,Frequency domain,Pattern recognition,Adaptive Multi-Rate audio codec,Transform coding,Speech recognition,Advanced Audio Coding,Encoder | Conference |
Volume | ISSN | ISBN |
2 | 1520-6149 | 0-7803-5041-3 |
Citations | PageRank | References |
9 | 1.88 | 3 |
Authors | ||
4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Akio Jin | 1 | 17 | 4.01 |
Moriya, T. | 2 | 62 | 8.44 |
Norimatsu, T. | 3 | 9 | 2.22 |
Tsushima, M. | 4 | 9 | 1.88 |