Title
Automatic Detection Of Pharyngeal Fricatives In Cleft Palate Speech Using Acoustic Features Based On The Vocal Tract Area Spectrum
Abstract
The pharyngeal fricative is a typical compensatory articulation disorder in cleft palate speech. It is produced by retracting the root of the tongue to the posterior pharyngeal wall to substitute for the fricatives and affricates produced in the oral cavity. People who use the pharyngeal fricative have difficulties in daily communication. Research on automatic pharyngeal fricative detection can provide aids in diagnosis for speech-language pathologists and clinical doctors. This work proposes a vocal tract area spectrum (VTAS) to represent a vocal tract model using time-varying cascaded pipes. Four acoustic features based on the VTAS (the centroid and spread (CS), peak linear deviation (PLD), relative-normal entropy (RNE), mean of the ratios? statistics (MRS)) are proposed to evaluate the differences between pharyngeal fricatives and normal speech. The CS feature is proposed to evaluate the overall shape of the vocal tract to detect whether there are abnormal gestures or movements of the articulators in speech production. The PLD and RNE features focus on the variation and complexity of each vocal tube?s area during the whole pronunciation process. The MRS feature is proposed to describe the continuity of the vocal tract. To evaluate the effectiveness of these four features, pharyngeal fricative detection experiments are conducted using a pharyngeal fricative dataset. This dataset contains 1246 speech samples spoken by 50 cleft palate patients and 50 normal speakers, covering all types of initial consonants in which the pharyngeal fricative usually occurs. The detection accuracy of the pharyngeal fricative using the CS, PLD, RNE and MRS feature ranges from 80.66% to 90.21%. When using the proposed CS +PLD+RNE+MRS feature, an accuracy of 95.18% can be achieved on the pharyngeal fricative dataset. ? 2021 Elsevier Ltd. All rights reserved.
Year
DOI
Venue
2021
10.1016/j.csl.2021.101203
COMPUTER SPEECH AND LANGUAGE
Keywords
DocType
Volume
Cleft palate speech, Pharyngeal fricative, Vocal tract area spectrum, Shape and continuity of vocal tract, Complexity of individual vocal tube area sequence
Journal
68
ISSN
Citations 
PageRank 
0885-2308
0
0.34
References 
Authors
10
4
Name
Order
Citations
PageRank
Jia Fu100.68
Fei He23213.85
Heng Yin300.34
Ling He411.70