Title
A robust hybrid approach for text line segmentation in historical documents
Abstract
Large-scale digitisation of historical documents demands robust methods that cope with the presence of frequent distortions and noisy artefacts. This paper presents a hybrid text line segmentation method that uses a novel data structure and a rule base to combine the strengths of top-down and bottom-up approaches while minimising their weaknesses. The effectiveness of the proposed approach has been methodically evaluated in the context of large-scale digitisation using a standardised framework. Results on a diverse dataset show improved performance over top-down and bottom-up approaches as well as over a leading commercially available system.
Year
Venue
Keywords
2012
ICPR
hybrid text line segmentation method,bottom-up approach,knowledge based systems,historical document,distortion,image segmentation,data structures,noisy artefacts,data structure,top-down approach,rule base,digitisation,text analysis
Field
DocType
ISSN
Data mining,Computer vision,Data structure,Scale-space segmentation,Pattern recognition,Computer science,Segmentation,Knowledge-based systems,Segmentation-based object categorization,Image segmentation,Artificial intelligence,Distortion
Conference
1051-4651
ISBN
Citations 
PageRank 
978-1-4673-2216-4
2
0.36
References 
Authors
6
3
Name
Order
Citations
PageRank
Christian Clausner1448.49
Apostolos Antonacopoulos237836.45
stefan pletschacher321620.78