Title
Assessment of disordered voice via the first rahmonic
Abstract
A number of studies have shown that the amplitude of the first rahmonic peak (R1) in the cepstrum can be usefully employed to indicate hoarse voice quality. The cepstrum is obtained by taking the inverse Fourier transform of the log-magnitude spectrum. In the present study, a number of spectral pre-processing steps are investigated prior to computing the cepstrum; the pre-processing steps include period-synchronous, period-asynchronous, harmonic-synchronous and harmonic-asynchronous spectral band-limitation analysis. The analysis is applied on both sustained vowels [a] and connected speech signals. The correlation between R1 (the amplitude of the first rahmonic) and perceptual ratings is examined for a corpus comprising 251 speakers. It is observed that the correlation between R1 and perceptual ratings increases when the spectrum is band-limited prior to computing the cepstrum. In addition, comparisons are made with a previously reported cepstral cue, cepstral peak prominence (CPP).
Year
DOI
Venue
2012
10.1016/j.specom.2011.04.001
Speech Communication
Keywords
Field
DocType
connected speech,spectral pre-processing step,connected speech signal,sustained vowel,correlation analysis,cepstrum,disordered voice analysis,cepstral cue,first rahmonic,disordered voice,log-magnitude spectrum,cepstral peak prominence,harmonic-asynchronous spectral band-limitation analysis,rahmonic peak,perceptual rating,perceptual ratings increase,pre-processing step
Hoarse voice quality,Connected speech,Mel-frequency cepstrum,Pattern recognition,Computer science,Cepstrum,Speech recognition,Fourier transform,Correlation,Artificial intelligence,Amplitude,Correlation analysis
Journal
Volume
Issue
ISSN
54
5
Speech Communication
Citations 
PageRank 
References 
3
0.50
1
Authors
5
Name
Order
Citations
PageRank
Ali Alpan1153.84
Jean Schoentgen212743.46
Y. Maryn3142.81
Francis Grenez48226.07
P. Murphy530.50