Abstract | ||
---|---|---|
Hypernasal speech is a common symptom across several neurological disorders; however it has a variable acoustic signature, making it difficult to quantify acoustically or perceptually. In this paper, we propose the nasal cognate distinctiveness features as an objective proxy for hypernasal speech. Our method is motivated by the observation that incomplete velopharyngeal closure changes the acoustics of the resultant speech such that alveolar stops /t/ and /d/ map to the alveolar nasal /n/ and bilabial stops /b/ and /p/ map to bilabial nasal /m/. We propose a new family of features based on likelihood ratios between the plosives and their respective nasal cognates. These features are based on an acoustic model that is trained only on healthy speech, and evaluated on a set of 75 speakers diagnosed with different dysarthria subtypes and exhibiting varying levels of hypernasality. Our results show that the family of features compares favorably with the clinical perception of speech-language pathologists subjectively evaluating hypernasality. |
Year | DOI | Venue |
---|---|---|
2019 | 10.1109/icassp.2019.8682339 | 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) |
Keywords | Field | DocType |
speech, hypernasality, dysarthria, velopharyngeal dysfunction, automatic speech recognition | Pattern recognition,Computer science,Artificial intelligence,Audiology,Dysarthria,Hypernasal speech,Nasalization,Acoustic model | Conference |
Volume | ISSN | Citations |
2019 | 1520-6149 | 0 |
PageRank | References | Authors |
0.34 | 0 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Michael Saxon | 1 | 0 | 3.72 |
Julie Liss | 2 | 10 | 5.98 |
Visar Berisha | 3 | 76 | 22.38 |