Title
Multi-Band And Multi-Cue Analyses Of Disordered Connected Speech
Abstract
The objective is to analyze vocal dysperiodicities in connected speech produced by dysphonic speakers. The analysis involves a speech variogram-based method that enables tracking instantaneous vocal dysperiodicities. The dysperiodicity trace is summarized by means of the signal-to-dysperiodicity ratio, which has been shown to correlate strongly with the perceived degree of hoarseness of the speaker. Previously, this method has been evaluated on small corpora. In the study that is reported here the corpus has comprised 28 normophonic and 223 dysphonic speakers. This has enabled carrying out the analysis in multiple frequency bands and submitting the signal-to-dysperiodicity ratios per band to multi-variable linear regression analysis with a view to predicting the perceptual ratings of the disordered speech fragments. The analysis results are compared to the cepstral peak prominence, which is a cue that indirectly summarizes vocal dysperiodicities frame-wise via the size of the first rhamonic of the speech cepstrum. Results show that the signal-to-dysperiodicity ratios obtained for low-frequency bands up to 1500 Hz contribute most to the prediction of the perceptual scores. Also, combining the cepstral peak prominence with the low frequency-band signal-to-dysperiodicity ratio increases their common correlation with perceptual scores to 0.8.
Year
Venue
Keywords
2008
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5
analysis of connected disordered speech, variogram analysis, signal-to-dysperiodicity ratio, cepstral peak prominence
Field
DocType
Citations 
Connected speech,Multi band,Pattern recognition,Computer science,Speech recognition,Artificial intelligence
Conference
0
PageRank 
References 
Authors
0.34
3
5
Name
Order
Citations
PageRank
Ali Alpan1153.84
Y. Maryn2142.81
Francis Grenez38226.07
Abdellah Kacha4257.91
Jean Schoentgen512743.46