Abstract | ||
---|---|---|
This paper aims at comparing the discrimination between audio, 2D-based visual and 3D-based visual features for the speech recognition purpose. The audio and visual feature extraction schemes and several feature selection techniques are described first in this paper. With the application of the described feature extraction and selection methods, several experiments are conducted to compare the discrimination of the audio features, the 2D visual features and the 3D visual features for the hVd words classification task. In our study, it is found that the 3D visual features have more separability than the 2D visual features, so that the 3D-based audio-visual speech recognition may achieve more desirable results than the traditional 2D-based counterpart. |
Year | DOI | Venue |
---|---|---|
2012 | 10.1109/ACSSC.2012.6489302 | ACSCC |
Keywords | DocType | ISSN |
2d-based visual feature extraction scheme,speech recognition,3d-based visual feature extraction scheme,audio feature extraction scheme,feature extraction,hvd words classification task,feature selection technique,3d-based audio-visual speech recognition | Conference | 1058-6393 |
ISBN | Citations | PageRank |
978-1-4673-5050-1 | 4 | 0.48 |
References | Authors | |
7 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Chao Sui | 1 | 20 | 1.81 |
Roberto Togneri | 2 | 814 | 48.33 |
Serajul Haque | 3 | 18 | 2.42 |
M. Bennamoun | 4 | 3197 | 167.23 |