Combined X-ray and facial videos for phoneme-level articulator dynamics - Citegraph

Paper Info

Title
Combined X-ray and facial videos for phoneme-level articulator dynamics

Abstract
Dynamic external and internal articulator motions are integrated into a low-cost data-driven three-dimensional talking head in this paper. External and internal articulations are defined and calibrated from the video streams and the videofluoroscopy to a generic 3D talking head model. Three different deformation modes in relation to pronunciation characteristics of muscular soft tissue of lips and tongue, up-down movements of chin and the relatively fixed articulators are set up and integrated. The shape blending functions among segmented phonemes of natural speech input are synthesized in an utterance. Animations of the confusable phonemes and minimal pairs are shown to English teachers and learners for a perception test. The results show that the proposed method can reflect the real situation of phonetic pronunciation realistically.

Year	DOI	Venue
2010	10.1007/s00371-010-0434-1	The Visual Computer
Keywords	Field	DocType
muscular soft tissue,facial video,internal articulator motion,phoneme-level articulator dynamic,english teacher,phonetic pronunciation,confusable phoneme,internal articulator dynamics · computer-assisted pronunciation learning · talking head,head model,internal articulation,combined x-ray,minimal pair,different deformation mode,pronunciation characteristic,soft tissue,three dimensional	Pronunciation,Computer vision,Computer science,Utterance,Speech recognition,Chin,Artificial intelligence,Articulator,Perception	Journal
Volume	Issue	ISSN
26	6-8	1432-2315
Citations	PageRank	References
6	0.63	11
Authors
4

Authors (4 rows)

Cited by (6 rows)

References (11 rows)

Name	Order	Citations	PageRank
Hui Chen	1	83	9.69
Lan Wang	2	26	7.75
Wenxi Liu	3	196	17.18
Pheng-Ann Heng	4	3565	280.98

1