Title
Frequency-domain spectral envelope estimation for low rate coding of speech
Abstract
Estimation of the spectral envelope in the frequency domain allows to avoid some problems of linear prediction (LP) algorithms for voiced speech. We present a low complexity method of spectral envelope estimation from harmonics for low rate coding. The method consists in computing the harmonic amplitude spectrum using pitch-synchronous DFT with length depending on voicing, modifying this spectrum outside the telephone bandwidth to simplify modeling of the useful bandwidth and interpolating it by a frequency-domain low-pass filter. An all-pole model is then fitted to this modified smoothed version of the harmonic spectrum. The method was implemented on the harmonic-stochastic excitation (HSX) vocoder and the performance was compared with the LP algorithm similar to that used in the G.729 speech coding standard. A-B comparative tests show an important increase in perceptual quality.
Year
DOI
Venue
1999
10.1109/ICASSP.1999.758110
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference
Keywords
Field
DocType
computational complexity,discrete Fourier transforms,frequency-domain analysis,harmonic analysis,interpolation,low-pass filters,spectral analysis,speech coding,transform coding,G.729 speech coding standard,HSX vocoder,all-pole mode,frequency-domain low-pass filter,frequency-domain spectral envelope estimation,harmonic amplitude spectrum,harmonic-stochastic excitation vocoder,interpolation,low complexity method,low rate coding,modified smoothed version,perceptual quality,performance,pitch-synchronous DFT,useful bandwidth,voiced speech
Frequency domain,Spectral envelope,Speech coding,Computer science,Harmonic,Linear prediction,Speech recognition,Bandwidth (signal processing),Harmonic Vector Excitation Coding,Harmonic spectrum
Conference
Volume
ISSN
ISBN
1
1520-6149
0-7803-5041-3
Citations 
PageRank 
References 
6
1.34
8
Authors
2
Name
Order
Citations
PageRank
Jelinek, M.161.34
J. Adoul229063.42