Abstract | ||
---|---|---|
In this paper, we propose a technique for removing margin noise (both textual and non-textual noise) from scanned document images. We perform layout analysis to detect words, lines, and paragraphs in the document image. These detected elements are classified into text and non-text components on the basis of their characteristics (size, position, etc.). The geometric properties of the text blocks are sought to detect and remove the margin noise. We evaluate our algorithm on several scanned pages of Bengali literature books. |
Year | DOI | Venue |
---|---|---|
2012 | 10.1145/2432553.2432570 | DAR@ICVGIP |
Keywords | Field | DocType |
margin noise,margin noise removal,printed document image,text block,scanned page,layout analysis,bengali literature book,non-text component,document image,non-textual noise,geometric property,scanned document image,segmentation,connected components | Pattern recognition,Segmentation,Document layout analysis,Bengali,Connected component,Artificial intelligence,Engineering,Noise removal | Conference |
Citations | PageRank | References |
6 | 0.50 | 8 |
Authors | ||
4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Soumyadeep Dey | 1 | 12 | 3.00 |
Jayanta Mukhopadhyay | 2 | 72 | 26.05 |
Shamik Sural | 3 | 1008 | 96.36 |
Partha Bhowmick | 4 | 6 | 0.84 |