Title
Data structures and tools for document database generation: an experimental system
Abstract
This paper is a contribution to the discussion of the structure and the elements of databases for document analysis tasks and the tools needed for database creation. It is pointed out that it is desirable to have a uniform document database that allows access to different kinds of data for different sub tasks within a complete document analysis system. Conceptual ideas pertaining to the data structure are discussed on the assumption of a hierarchical document structure. A description of an implemented data structure: is also included that may serve as a starting point for further investigation and discussion. Finally, we present INSEGD, an experimental system for interactive segmentation and labelling of arbitrary documents, which is still under development along with a tool box for automatically and semi-automatically generating segmentations for support in data generation.
Year
DOI
Venue
1995
10.1109/ICDAR.1995.602002
ICDAR-1
Keywords
Field
DocType
text analysis,hidden markov models,document structure,data structure,image segmentation,benchmark testing,labeling,artificial neural networks,data acquisition,independent component analysis,databases,data analysis,data structures
Data mining,Data structure,Experimental system,Information retrieval,Computer science,Segmentation,Data acquisition,Document Structure Description,Image segmentation,Database,Test data generation,Benchmark (computing)
Conference
Volume
ISBN
Citations 
2
0-8186-7128-9
3
PageRank 
References 
Authors
0.60
0
2
Name
Order
Citations
PageRank
Rolf-Dieter Bippus172.19
Volker Märgner229529.02