Title
Introduction Of Statistical Information In A Syntactic Analyzer For Document Image Recognition
Abstract
This paper presents an improvement to document layout analysis systems, offering a possible solution of Sayre's paradox (which states that an element "must be recognized before it can be segmented; and it must be segmented before it can be recognized"). This improvement, based on stochastic parsing, allows integration of statistical information, obtained from recognizers, during syntactic layout analysis. We present how this fusion of numeric and symbolic information in a feedback loop can be applied to syntactic methods to improve document description expressiveness. To limit combinatorial explosion during exploration of solutions, we devised an operator that allows optional activation of the stochastic parsing mechanism. Our evaluation on 1250 handwritten business letters show this method allows the improvement of global recognition scores.
Year
DOI
Venue
2011
10.1117/12.873393
DOCUMENT RECOGNITION AND RETRIEVAL XVIII
Keywords
Field
DocType
layout analysis, structure recognition, stochastic parsing, content-based analysis, handwritten letters
Computer vision,Combinatorial method,Computer science,Document layout analysis,Feedback loop,Artificial intelligence,Operator (computer programming),Natural language processing,Parsing,Syntactic methods,Syntax,Combinatorial explosion
Conference
Volume
ISSN
Citations 
7874
0277-786X
2
PageRank 
References 
Authors
0.44
5
3
Name
Order
Citations
PageRank
André O. Maroneze120.44
Bertrand Coüasnon216919.22
Aurélie Lemaitre3639.41