Title | ||
---|---|---|
A multi-channel/multi-speaker articulatory database in Mandarin for speech visualization |
Abstract | ||
---|---|---|
The application of articulatory database in speech production and automatic speech recognition has been practiced for many years. The goal of the research was to build an articulatory database specifying in Chinese Mandarin production and to investigate its efficacy in speech animation. Carstens EMA AG501 device were respectively used to capture acoustic data and articulatory data. Also, a Microsoft Kinect camera was applied to capture face-tracking data as a supplement. Finally, we tried several methods to extract acoustic parameters and built up a 3D talking head model to verify the efficacy of the database. |
Year | DOI | Venue |
---|---|---|
2014 | 10.1109/ISCSLP.2014.6936629 | ISCSLP |
Keywords | Field | DocType |
acoustic data,articulatory data,carstens ema ag501 device,face recognition,speech recognition,ema,speech animation,computer animation,microsoft kinect camera,image sensors,multichannel-multispeaker articulatory database,chinese mandarin production,object tracking,articulatory database,mandarin,face-tracking data,kinect camera,3d talking head model,speech visualization,automatic speech recognition,speech production | Computer science,Visualization,Speech recognition,Multi channel,Animation,Speech production,Database,Mandarin Chinese | Conference |
Citations | PageRank | References |
4 | 0.40 | 8 |
Authors | ||
6 |