Abstract | ||
---|---|---|
In this paper, we present the 3D acquisition infrastructure we developed for building a talking face and studying some aspects of visual speech. A short-term aim is to study coarticulation for the French language and to develop a model which respects a real talker articulation. One key factor is to be able to acquire a large amount of 3D data with a low-cost system more flexible than existing motion capture systems (using infrared cameras and glued markers). Our system only uses two standard cameras, a PC and painted markers that do not change speech articulation and provides a sufficiently fast acquisition rate to enable an efficient temporal tracking of 3D points. We present here our stereovision data capture system and how these data can be used in acoustic-to-articulatory inversion. |
Year | Venue | Keywords |
---|---|---|
2005 | AVSP | data capture,infrared,motion capture,tracking,stereovision,coarticulation |
Field | DocType | Citations |
Computer vision,Motion capture,Computer science,Inversion (meteorology),Manner of articulation,Speech recognition,Coarticulation,Artificial intelligence,Automatic identification and data capture | Conference | 11 |
PageRank | References | Authors |
0.95 | 4 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
B. Wrobel-Dautcourt | 1 | 21 | 2.62 |
M. O. Berger | 2 | 63 | 8.51 |
B. Potard | 3 | 11 | 0.95 |
Y. Laprie | 4 | 13 | 1.42 |
S. Ouni | 5 | 11 | 0.95 |