Title
Deep Sign: Enabling Robust Statistical Continuous Sign Language Recognition via Hybrid CNN-HMMs.
Abstract
This manuscript introduces the end-to-end embedding of a CNN into a HMM, while interpreting the outputs of the CNN in a Bayesian framework. The hybrid CNN-HMM combines the strong discriminative abilities of CNNs with the sequence modelling capabilities of HMMs. Most current approaches in the field of gesture and sign language recognition disregard the necessity of dealing with sequence data both for training and evaluation. With our presented end-to-end embedding we are able to improve over the state-of-the-art on three challenging benchmark continuous sign language recognition tasks by between 15 and 38% relative reduction in word error rate and up to 20% absolute. We analyse the effect of the CNN structure, network pretraining and number of hidden states. We compare the hybrid modelling to a tandem approach and evaluate the gain of model combination.
Year
DOI
Venue
2018
10.1007/s11263-018-1121-3
International Journal of Computer Vision
Keywords
Field
DocType
Sign language recognition,Hybrid approach,CNN-HMM,Statistical approach,Sequence modelling
Embedding,Computer science,Gesture,Word error rate,Speech recognition,Sign language,Data sequences,Artificial intelligence,Hidden Markov model,Discriminative model,Machine learning,Bayesian probability
Journal
Volume
Issue
ISSN
126
12
0920-5691
Citations 
PageRank 
References 
5
0.46
28
Authors
4
Name
Order
Citations
PageRank
Oscar Koller11289.02
Sepehr Zargaran250.46
Hermann Ney3141781506.93
Richard Bowden41840118.50