Title | ||
---|---|---|
Visual Speech Recognition Method Using Translation, Scale and Rotation Invariant Features |
Abstract | ||
---|---|---|
This paper reports on a visual speech recognition method that is invariant to translation, rotation and scale. Dynamic features representing the mouth motion is extracted from the video data by using a motion segmentation technique termed as motion history image (MHI). MHI is generated by applying accumulative image differencing technique on the sequence of mouth images. Invariant features are derived from the MHI using feature extraction algorithm that combines Discrete Stationary Wavelet Transform (SWT) and moments. A 2-D SWT at level one is applied to decompose MHI to produce one approximate and three detail sub images. The feature descriptors consist of three moments (geometric moments, Hu moments and Zernike moments) computed from the SWT approximate image. The moments features are normalized to achieve the invariance properties. Artificial neural network (ANN) with back propagation learning algorithm is used to classify the moments features. Initial experiments were conducted to test the sensitivity of the proposed approach to rotation, translation and scale of the mouth images and obtained promising results. |
Year | DOI | Venue |
---|---|---|
2006 | 10.1109/AVSS.2006.118 | AVSS |
Keywords | Field | DocType |
2-d swt,dynamic feature,swt approximate image,detail sub image,accumulative image,motion history image,mouth image,visual speech recognition method,moments feature,rotation invariant features,mouth motion,motion segmentation technique,testing,history,computer vision,artificial neural network,feature extraction,artificial neural networks,stationary wavelet transform,back propagation,speech recognition,data mining | Computer science,Zernike polynomials,Artificial intelligence,Artificial neural network,Velocity Moments,Computer vision,Pattern recognition,Invariant (physics),Image differencing,Feature extraction,Speech recognition,Invariant (mathematics),Stationary wavelet transform | Conference |
ISBN | Citations | PageRank |
0-7695-2688-8 | 2 | 0.41 |
References | Authors | |
7 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Wai Chee Yau | 1 | 40 | 4.87 |
Dinesh Kant Kumar | 2 | 168 | 28.34 |
Sridhar Poosapadi Arjunan | 3 | 27 | 5.79 |