Title
Some observations on computer lip-reading: moving from the dream to the reality
Abstract
In the quest for greater computer lip-reading performance there are a number of tacit assumptions which are either present in the datasets (high resolution for example) or in the methods (recognition of spoken visual units called "visemes" for example). Here we review these and other assumptions and show the surprising result that computer lip-reading is not heavily constrained by video resolution, pose, lighting and other practical factors. However, the working assumption that visemes, which are the visual equivalent of phonemes, are the best unit for recognition does need further examination. We conclude that visemes, which were defined over a century ago, are unlikely to be optimal for a modern computer lip-reading system.
Year
DOI
Venue
2017
10.1117/12.2067464
Proceedings of SPIE
Keywords
Field
DocType
Lip-reading,speech recognition,pattern recognition
Computer graphics (images),Display resolution,Computer science,Viseme,Upload,Dream,Computing systems
Journal
Volume
ISSN
Citations 
9253
0277-786X
5
PageRank 
References 
Authors
0.45
5
4
Name
Order
Citations
PageRank
Helen L. Bear1307.10
gari p owen250.45
Richard Harvey344233.95
Barry-John Theobald433225.39