Title | ||
---|---|---|
Dual-Mode Avq Coding Based On Spectral Masking And Sparseness Detection For Itu-T G.711.1/G.722 Super-Wideband Extensions |
Abstract | ||
---|---|---|
ITU-T Recommendations G.711.1 Annex D and G.722 Annex B, which are super-wideband (50-14,000 Hz) extensions to G.711.1 and G.722, have been recently standardized. This paper introduces a new coding method proposed and employed in the above ITU-T standards. The proposed coding method employs an adaptive spectral masking of the algebraic vector quantization (AVQ) for MDCT-domain non-sparse signals. The adaptive spectral masking is switched on and off based on MDCT-domain sparseness analysis. When the target MDCT coefficients are categorized as non-sparse, masking level of the target MDCT coefficients is adaptively controlled using spectral envelope information. The performance of the proposed method as a part of the ITU-T G.711.1 Annex D is evaluated in comparison with the ordinary AVQ. Subjective listening test results show that the proposed method improves the sound quality more than 0.1 points with a five grade scale in average of speech, music and mixed content, and the significance of the improvement is validated. |
Year | Venue | Keywords |
---|---|---|
2011 | 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 | speech and audio coding, Standardization, ITU-T G.711.1 Annex D, ITU-T G.722 Annex B, super-wideband extension, algebraic vector quantization |
Field | DocType | Citations |
Wideband,Masking (art),Pattern recognition,Computer science,Speech recognition,Coding (social sciences),Artificial intelligence,G.722 | Conference | 0 |
PageRank | References | Authors |
0.34 | 1 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Masahiro Fukui | 1 | 42 | 14.57 |
Shigeaki Sasaki | 2 | 11 | 2.54 |
Yusuke Hiwasaki | 3 | 52 | 11.63 |
Sachiko Kurihara | 4 | 15 | 3.38 |
Yoichi Haneda | 5 | 97 | 20.16 |