Title
Combined X-ray and facial videos for phoneme-level articulator dynamics
Abstract
Dynamic external and internal articulator motions are integrated into a low-cost data-driven three-dimensional talking head in this paper. External and internal articulations are defined and calibrated from the video streams and the videofluoroscopy to a generic 3D talking head model. Three different deformation modes in relation to pronunciation characteristics of muscular soft tissue of lips and tongue, up-down movements of chin and the relatively fixed articulators are set up and integrated. The shape blending functions among segmented phonemes of natural speech input are synthesized in an utterance. Animations of the confusable phonemes and minimal pairs are shown to English teachers and learners for a perception test. The results show that the proposed method can reflect the real situation of phonetic pronunciation realistically.
Year
DOI
Venue
2010
10.1007/s00371-010-0434-1
The Visual Computer
Keywords
Field
DocType
muscular soft tissue,facial video,internal articulator motion,phoneme-level articulator dynamic,english teacher,phonetic pronunciation,confusable phoneme,internal articulator dynamics · computer-assisted pronunciation learning · talking head,head model,internal articulation,combined x-ray,minimal pair,different deformation mode,pronunciation characteristic,soft tissue,three dimensional
Pronunciation,Computer vision,Computer science,Utterance,Speech recognition,Chin,Artificial intelligence,Articulator,Perception
Journal
Volume
Issue
ISSN
26
6-8
1432-2315
Citations 
PageRank 
References 
6
0.63
11
Authors
4
Name
Order
Citations
PageRank
Hui Chen1839.69
Lan Wang2267.75
Wenxi Liu319617.18
Pheng-Ann Heng43565280.98