Title | ||
---|---|---|
A scheme for pitch extraction of speech using autocorrelation function with frame length proportional to the time lag |
Abstract | ||
---|---|---|
Although pitch extraction schemes based on the short-time autocorrelation function offer reliable results for the most part of the speech signal, the autocorrelation peaks indicating fundamental period fluctuate with frame position, causing occasional pitch extraction errors. In order to reduce these errors due to inappropriate analysis frame length and position, a scheme is proposed using a new definition of normalized short-time autocorrelation function. One of the major advantages of the definition over conventional ones is that the frame length changes in proportion to the time lag, and, therefore, the input speech can be analyzed without any knowledge on the fundamental frequency range of the speaker. Two methods are proposed for the normalization, and the one compensating variations of the short-time power of the waveform is shown to offer better results. A system for pitch extraction is constructed on a work station, and the validity of the proposed scheme is demonstrated by experiments using connected speech of male and female announcers. |
Year | DOI | Venue |
---|---|---|
1992 | 10.1109/ICASSP.1992.225950 | ICASSP'92 Proceedings of the 1992 IEEE international conference on Acoustics, speech and signal processing - Volume 1 |
Keywords | DocType | Volume |
correlation theory,speech analysis and processing,connected speech,error reduction,frame length,frame position,fundamental period,input speech,normalization,pitch extraction errors,short-time autocorrelation function,short-time power,speech processing,speech signal,time lag,waveform,workstation | Conference | 1 |
ISBN | Citations | PageRank |
0-7803-0532-9 | 9 | 1.61 |
References | Authors | |
1 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Keikichi Hirose | 1 | 714 | 175.38 |
Hiroya Fujisaki | 2 | 259 | 113.38 |
Shigenobu Seto | 3 | 25 | 7.52 |