Title
Overlapping and multi-touching text-line segmentation by Block Covering analysis
Abstract
This paper presents a new approach for text-line segmentation based on Block Covering which solves the problem of overlapping and multi-touching components. Block Covering is the core of a system which processes a set of ancient Arabic documents from historical archives. The system is designed for separating text-lines even if they are overlapping and multi-touching. We exploit the Block Covering technique in three steps: a new fractal analysis (Block Counting) for document classification, a statistical analysis of block heights for block classification and a neighboring analysis for building text-lines. The Block Counting fractal analysis, associated with a fuzzy C-means scheme, is performed on document images in order to classify them according to their complexity: tightly (closely) spaced documents (TSD) or widely spaced documents (WSD). An optimal Block Covering is applied on TSD documents which include overlapping and multi-touching lines. The large blocks generated by the covering are then segmented by relying on the statistical analysis of block heights. The final labeling into text-lines is based on a block neighboring analysis. Experimental results provided on images of the Tunisian Historical Archives reveal the feasibility of the Block Covering technique for segmenting ancient Arabic documents.
Year
DOI
Venue
2009
10.1007/s10044-008-0127-9
Pattern Anal. Appl.
Keywords
Field
DocType
block covering technique,optimal block covering,block height,statistical analysis,spaced document,neighboring analysis,block covering,new fractal analysis,block covering analysis,fractal analysis,text-line segmentation,block coveringtext-line segmentation � overlapping and multi-touching linesblock counting � ancient arabic documents,ancient arabic document,counting,arabic,classification,fractal,image segmentation,coverage,text analysis,document processing,feasibility,fuzzy logic,image analysis
Fractal analysis,Document classification,Arabic,Pattern recognition,Segmentation,Fuzzy logic,Document processing,Fractal,Image segmentation,Artificial intelligence,Mathematics
Journal
Volume
Issue
ISSN
12
4
1433-755X
Citations 
PageRank 
References 
5
0.52
14
Authors
4
Name
Order
Citations
PageRank
Abderrazak Zahour128214.83
Brunco Taconet250.52
Laurence Likforman-Sulem356043.90
Wafa Boussellaa4101.36