Title
An efficient solution to sparse linear prediction analysis of speech.
Abstract
We propose an efficient solution to the problem of sparse linear prediction analysis of the speech signal. Our method is based on minimization of a weighted l2-norm of the prediction error. The weighting function is constructed such that less emphasis is given to the error around the points where we expect the largest prediction errors to occur (the glottal closure instants) and hence the resulting cost function approaches the ideal l0-norm cost function for sparse residual recovery. We show that the efficient minimization of this objective function (by solving normal equations of linear least squares problem) provides enhanced sparsity level of residuals compared to the l1-norm minimization approach which uses the computationally demanding convex optimization methods. Indeed, the computational complexity of the proposed method is roughly the same as the classic minimum variance linear prediction analysis approach. Moreover, to show a potential application of such sparse representation, we use the resulting linear prediction coefficients inside a multi-pulse synthesizer and show that the corresponding multi-pulse estimate of the excitation source results in slightly better synthesis quality when compared to the classical technique which uses the traditional non-sparse minimum variance synthesizer.
Year
DOI
Venue
2013
10.1186/1687-4722-2013-3
EURASIP J. Audio, Speech and Music Processing
Keywords
Field
DocType
Speech Signal, Sparse Representation, Vocal Tract, Linear Predictive Code, Linear Prediction Coefficient
Minimum-variance unbiased estimator,Residual,Weighting,Computer science,Sparse approximation,Linear prediction,Speech recognition,Linear least squares,Convex optimization,Computational complexity theory
Journal
Volume
Issue
ISSN
2013
1
1687-4722
Citations 
PageRank 
References 
5
0.46
11
Authors
2
Name
Order
Citations
PageRank
Vahid Khanagha1413.97
Khalid Daoudi214523.68