Abstract | ||
---|---|---|
In this paper we propose a model-based approach to instantaneous pitch estimation in noisy speech, by way of incorporating pitch smoothness assumptions into the well-known harmonic model. In this approach, the latent pitch contour is modeled using a basis of smooth polynomials, and is fit to waveform data by way of a harmonic model whose partials have time-varying amplitudes. The resultant nonlinear least squares estimation task is accomplished through the Gauss-Newton method with a novel initialization step that serves to greatly increase algorithm efficiency. We demonstrate the accuracy and robustness of our method through comparisons to state-of-the art pitch estimation algorithms using both simulated and real waveform data. |
Year | Venue | Keywords |
---|---|---|
2009 | INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 | Harmonic model, instantaneous pitch estimation |
Field | DocType | Citations |
Pattern recognition,Computer science,Speech recognition,Artificial intelligence | Conference | 1 |
PageRank | References | Authors |
0.47 | 5 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Jung Ook Hong | 1 | 2 | 0.84 |
Patrick J. Wolfe | 2 | 355 | 40.15 |