Title
ODIL: an SGML description language of the layout structure of documents
Abstract
This paper describes a coding format in SGML for the output of a document recognition prototype. Our proposal is a DTD named "ODIL"-Office Document Image description Language-that describes precisely the layout structure of a document after all recognition phases, including OCR. All layout objects of a document are defined in the form of SGML elements, and their characteristics are defined by SGML attributes. The basic objects are blocks, containing homogeneous information. Five types of information are supported by the ODIL language: texts, photos, line graphics, tables, mathematic formulas. The ODIL representation of the recognition results is well adapted to a further logical structure recognition. Starting from the ODIL DTD and using the RAINBOW transit DTD will permit to use SGML tools for the logical structure recognition which is viewed as an SGML up-conversion problem.
Year
DOI
Venue
1995
10.1109/ICDAR.1995.599040
ICDAR-1
Keywords
Field
DocType
odil representation,odil language,odil dtd,sgml element,document recognition prototype,sgml tool,sgml up-conversion problem,recognition phase,layout structure,sgml description language,logical structure recognition,recognition result,pixel,image recognition,segmentation,prototypes,mathematics,odl,sgml,layout,graphics,image segmentation
Graphics,Document type declaration,SGML,Information retrieval,Computer science,Coding (social sciences),Structure (mathematical logic),SGML entity,Processing Instruction,Document type definition
Conference
ISBN
Citations 
PageRank 
0-8186-7128-9
2
0.54
References 
Authors
0
2
Name
Order
Citations
PageRank
P. Lefevre120.54
F. Reynaud220.54