Abstract | ||
---|---|---|
This paper presents the application of a radically novel approach, called the Microcanonical Multiscale Formalism (MMF) to speech analysis. MMF is based on precise estimation of local scaling parameters that describe the inter-scale correlations at each point in the signal domain and provides efficient means for studying local non-linear dynamics of complex signals. In this paper we introduce an efficient way for estimation of these parameters and then, we show that they convey relevant information about local dynamics of the speech signal that can be used for the task of phonetic segmentation. We thus develop a two-stage segmentation algorithm: for the first step, we introduce a new dynamic programming technique to efficiently generate an initial list of phoneme-boundary candidates and in the second step, we use hypothesis testing to refine the initial list of candidates. We present extensive experiments on the full TIMIT database. The results show that our algorithm is significantly more accurate than state-of-the-art ones. |
Year | DOI | Venue |
---|---|---|
2014 | 10.1016/j.dsp.2014.08.002 | Digital Signal Processing |
Keywords | Field | DocType |
Nonlinear speech processing,Multiscale signal processing,Phonetic segmentation of speech signals,Piece-wise linear approximation | Dynamic programming,Singularity analysis,Pattern recognition,Segmentation,Computer science,Timit database,Artificial intelligence,Formalism (philosophy),Scaling,Statistical hypothesis testing | Journal |
Volume | Issue | ISSN |
35 | C | 1051-2004 |
Citations | PageRank | References |
12 | 0.62 | 15 |
Authors | ||
4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Vahid Khanagha | 1 | 41 | 3.97 |
Khalid Daoudi | 2 | 145 | 23.68 |
Oriol Pont | 3 | 54 | 5.95 |
Hussein M. Yahia | 4 | 87 | 16.06 |