Title
Emulating the perceptual capabilities of a human evaluator to map the GRB scale for the assessment of voice disorders
Abstract
This paper presents the design of an automatic voice quality analysis system for the assessment of voice pathologies, which emulates the perceptual capabilities of a human evaluator according the GRB scale. For this purpose, a novel methodology based on multiple sets of characteristics, ordinal classification and Gaussian regression is proposed. In particular, a reduced subset of characteristics is identified, and the regressor is used to convert the discrete perceptual scale to a continuum, more in agreement to the nature of the problem under study. The robustness of the system is evaluated in several cross-dataset experiments. Similarly, a clinical evaluation of the predictions provided by the system is carried out. Results indicate that the proposed methodology is proficient in modelling the perceptual capabilities of the human evaluator. They also show that it is possible to extend the GRB scale to a continuum through regression techniques while maintaining the consistency of the results. On average, the deviation between the labels assessed by the expert and the ones provided by the system is of about 0.5 units (in a scale from 0 to 3) for G and B, and of 0.7 units for R. Similarly, the deviation of the labels predicted by the system in the clinical assessment trials is about 0.3 units for G, 0.4 units for B, and 0.5 units for R.
Year
DOI
Venue
2019
10.1016/j.engappai.2019.03.027
Engineering Applications of Artificial Intelligence
Keywords
Field
DocType
Automatic voice quality analysis,GRBAS scale,Voice assessment,Breathiness,Roughness,Hoarseness
Gamma-ray burst,Regression,Ordinal number,Computer science,Robustness (computer science),Gaussian,Artificial intelligence,Perception,Machine learning
Journal
Volume
ISSN
Citations 
82
0952-1976
0
PageRank 
References 
Authors
0.34
0
5