Title
Harmonic-stochastic excitation (HSX) speech coding below 4 kbit/s
Abstract
This paper presents an algorithm for encoding speech signals at bit rates below 4 kbit/s based on a mixed harmonic and stochastic modeling of the excitation signal. The algorithm uses robust pitch tracking and efficient voicing analysis to determine the ratios of the harmonic and stochastic components. The harmonic component is synthesized using a bank of bandpass filters while the stochastic component is synthesized using inverse STFT with overlap-and-add. Postfiltering is utilized at the decoder to enhance the quality of synthesized speech. A 2.4 kbit/s version of the algorithm was formally tested and the DAM and DRT scores showed that the coder performance is comparable to that of DoD 4.8 kbit/s Federal Standard FS-1016.
Year
DOI
Venue
1996
10.1109/ICASSP.1996.540326
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference
Keywords
Field
DocType
Fourier transforms,band-pass filters,harmonic analysis,linear predictive coding,speech coding,speech processing,speech synthesis,stochastic processes,2.4 kbit/s,4 kbit/s,DAM score,DRT score,HSX algorithm,bandpass filters,efficient voicing analysis,harmonic-stochastic excitation,inverse STFT,low bit rate,overlap-and-add,postfiltering,robust pitch tracking,speech coding,synthesized speech quality
Speech processing,Speech synthesis,Speech coding,Algorithm design,Computer science,Harmonic,Speech recognition,Harmonic analysis,Harmonic Vector Excitation Coding,Linear predictive coding
Conference
Volume
ISSN
ISBN
1
1520-6149
0-7803-3192-3
Citations 
PageRank 
References 
7
0.82
7
Authors
4
Name
Order
Citations
PageRank
Laflamme, C.170.82
Salami, R.270.82
Matmti, R.370.82
J. Adoul429063.42