Abstract | ||
---|---|---|
This paper presents a new approach for text-line segmentation based on Block Covering which solves the problem of overlapping and multi-touching components. Block Covering is the core of a system which processes a set of ancient Arabic documents from historical archives. The system is designed for separating text-lines even if they are overlapping and multi-touching. We exploit the Block Covering technique in three steps: a new fractal analysis (Block Counting) for document classification, a statistical analysis of block heights for block classification and a neighboring analysis for building text-lines. The Block Counting fractal analysis, associated with a fuzzy C-means scheme, is performed on document images in order to classify them according to their complexity: tightly (closely) spaced documents (TSD) or widely spaced documents (WSD). An optimal Block Covering is applied on TSD documents which include overlapping and multi-touching lines. The large blocks generated by the covering are then segmented by relying on the statistical analysis of block heights. The final labeling into text-lines is based on a block neighboring analysis. Experimental results provided on images of the Tunisian Historical Archives reveal the feasibility of the Block Covering technique for segmenting ancient Arabic documents. |
Year | DOI | Venue |
---|---|---|
2009 | 10.1007/s10044-008-0127-9 | Pattern Anal. Appl. |
Keywords | Field | DocType |
block covering technique,optimal block covering,block height,statistical analysis,spaced document,neighboring analysis,block covering,new fractal analysis,block covering analysis,fractal analysis,text-line segmentation,block coveringtext-line segmentation � overlapping and multi-touching linesblock counting � ancient arabic documents,ancient arabic document,counting,arabic,classification,fractal,image segmentation,coverage,text analysis,document processing,feasibility,fuzzy logic,image analysis | Fractal analysis,Document classification,Arabic,Pattern recognition,Segmentation,Fuzzy logic,Document processing,Fractal,Image segmentation,Artificial intelligence,Mathematics | Journal |
Volume | Issue | ISSN |
12 | 4 | 1433-755X |
Citations | PageRank | References |
5 | 0.52 | 14 |
Authors | ||
4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Abderrazak Zahour | 1 | 282 | 14.83 |
Brunco Taconet | 2 | 5 | 0.52 |
Laurence Likforman-Sulem | 3 | 560 | 43.90 |
Wafa Boussellaa | 4 | 10 | 1.36 |