Title
Speech Analysis and Synthesis with a Computationally Efficient Adaptive Harmonic Model
Abstract
Harmonic models have to be both precise and fast in order to represent the speech signal adequately and be able to process large amount of data in a reasonable amount of time. For these purposes, the full-band adaptive harmonic model (aHM) used by the adaptive iterative refinement (AIR) algorithm has been proposed in order to accurately model the perceived characteristics of a speech signal. Even though aHM-AIR is precise, it lacks the computational efficiency that would make its use convenient for large databases. The least squares (LS) solution used in the original aHM-AIR accounts for most of the computational load. In a previous paper, we suggested a peak picking (PP) approach as a substitution to the LS solution. In order to integrate the adaptivity scheme of aHM in the PP approach, an adaptive discrete Fourier transform (aDFT), whose frequency basis can fully follow the variations of the curve, was also proposed. In this paper, we complete the previous publication by evaluating the above methods for the whole analysis process of a speech signal. Evaluations have shown an average time reduction by four times using PP and aDFT compared to the LS solution. Additionally, based on formal listening tests, when using PP and aDFT, the quality of the re-synthesis is preserved compared to the original LS-based approach.
Year
DOI
Venue
2015
10.1109/TASLP.2015.2458580
IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP)
Keywords
Field
DocType
Fundamental frequency,harmonic model,peak picking (PP),speech analysis/synthesis,voice model
Least squares,Iterative refinement,Fundamental frequency,Computer science,Harmonic,Speech recognition,Harmonic analysis,Time–frequency analysis,Harmonic model,Discrete Fourier transform
Journal
Volume
Issue
ISSN
23
11
2329-9290
Citations 
PageRank 
References 
0
0.34
17
Authors
3
Name
Order
Citations
PageRank
Veronica Morfi122.08
Gilles Degottex220314.84
Athanasios Mouchtaris319128.24