Title
Segmentation of Glottal Images from High-Speed Videoendoscopy Optimized by Synchronous Acoustic Recordings
Abstract
Laryngeal high-speed videoendoscopy (LHSV) is an imaging technique offering novel visualization quality of the vibratory activity of the vocal folds. However, in most image analysis methods, the interaction of the medical personnel and access to ground truth annotations are required to achieve accurate detection of vocal folds edges. In our fully automatic method, we combine video and acoustic data that are synchronously recorded during the laryngeal endoscopy. We show that the image segmentation algorithm of the glottal area can be optimized by matching the Fourier spectra of the pre-processed video and the spectra of the acoustic recording during the phonation of sustained vowel /i:/. We verify our method on a set of LHSV recordings taken from subjects with normophonic voice and patients with voice disorders due to glottal insufficiency. We show that the computed geometric indices of the glottal area make it possible to discriminate between normal and pathologic voices. The median of the Open Quotient and Minimal Relative Glottal Area values for healthy subjects were 0.69 and 0.06, respectively, while for dysphonic subjects were 1 and 0.35, respectively. We also validate these results using independent phoniatrician experts.
Year
DOI
Venue
2022
10.3390/s22051751
SENSORS
Keywords
DocType
Volume
vocal disorders, laryngeal high-speed video, image segmentation, acoustic recordings of voice, signal processing, multimodal sensing
Journal
22
Issue
ISSN
Citations 
5
1424-8220
0
PageRank 
References 
Authors
0.34
0
4
Name
Order
Citations
PageRank
Bartosz Kopczynski101.35
Ewa Niebudek-Bogusz200.34
Wioletta Pietruszewska300.34
Pawel Strumillo49315.54