Optimization of an image-based talking head system - Citegraph

Paper Info

Title
Optimization of an image-based talking head system

Abstract
This paper presents an image-based talking head system, which includes two parts: analysis and synthesis. The audiovisual analysis part creates a face model of a recorded human subject, which is composed of a personalized 3D mask as well as a large database of mouth images and their related information. The synthesis part generates natural looking facial animations from phonetic transcripts of text. A critical issue of the synthesis is the unit selection which selects and concatenates these appropriate mouth images from the database such that they match the spoken words of the talking head. Selection is based on lip synchronization and the similarity of consecutive images. The unit selection is refined in this paper, and Pareto optimization is used to train the unit selection. Experimental results of subjective tests show that most people cannot distinguish our facial animations from real videos.

Year	DOI	Venue
2009	10.1155/2009/174192	EURASIP J. Audio, Speech and Music Processing
Keywords	Field	DocType
facial animation,unit selection,large database,synthesis part,consecutive image,head system,audiovisual analysis part,appropriate mouth image,mouth image,pareto optimization	Computer vision,Synchronization,Computer science,Image based,Multi-objective optimization,Speech recognition,Computer facial animation,Artificial intelligence	Journal
Volume	Issue	ISSN
2009,	1	1687-4722
Citations	PageRank	References
11	0.69	23
Authors
2

Authors (2 rows)

Cited by (11 rows)

References (23 rows)

Name	Order	Citations	PageRank
Kang Liu	1	72	10.93
Joern Ostermann	2	245	17.66

1