Title
Combining Phone Posteriorgrams From Strong And Weak Recognizers For Automatic Speech Assessment Of People With Aphasia
Abstract
This paper presents an investigation on applying automatic speech recognition (ASR) to speech assessment of people with aphasia (PWA). A distinctive characteristic of PWA speech is paraphasia, which refers to frequent occurrence of phonemic errors, unintended words and non-verbal sounds. In view of the wide variety of paraphasias, we propose to view the ASR errors so caused as out-of-vocabulary (OOV) words. Inspired by previous research on OOV detection, paraphasias in PWA speech are captured by comparing the phone posteriorgrams of a strongly constrained speech recognizer and a weakly constrained one. The posteriorgrams also reveal other characteristics of impaired speech, e.g., change of speaking rate, voice abnormality. Siamese and 2-channel convolutional neural network (CNN) models are used for classifying the posteriorgram pairs and predicting the severity of aphasia. Experimental results on a Cantonese database of PWA speech confirm the effectiveness of the proposed methods. The best F1 score attained on binary classification (severe versus mild aphasia) is 0:891.
Year
DOI
Venue
2019
10.1109/icassp.2019.8683835
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP)
Keywords
Field
DocType
Aphasia, speech assessment, ASR, phone posteriorgrams, CNN
F1 score,Automatic speech,Pattern recognition,Binary classification,Convolutional neural network,Computer science,Impaired speech,Aphasia,Speech recognition,Phone,Artificial intelligence,Paraphasia
Conference
ISSN
Citations 
PageRank 
1520-6149
0
0.34
References 
Authors
0
3
Name
Order
Citations
PageRank
Ying Qin115.43
Tan Lee247674.69
anthony pak hin kong302.03