Title
Margin noise removal from printed document images
Abstract
In this paper, we propose a technique for removing margin noise (both textual and non-textual noise) from scanned document images. We perform layout analysis to detect words, lines, and paragraphs in the document image. These detected elements are classified into text and non-text components on the basis of their characteristics (size, position, etc.). The geometric properties of the text blocks are sought to detect and remove the margin noise. We evaluate our algorithm on several scanned pages of Bengali literature books.
Year
DOI
Venue
2012
10.1145/2432553.2432570
DAR@ICVGIP
Keywords
Field
DocType
margin noise,margin noise removal,printed document image,text block,scanned page,layout analysis,bengali literature book,non-text component,document image,non-textual noise,geometric property,scanned document image,segmentation,connected components
Pattern recognition,Segmentation,Document layout analysis,Bengali,Connected component,Artificial intelligence,Engineering,Noise removal
Conference
Citations 
PageRank 
References 
6
0.50
8
Authors
4
Name
Order
Citations
PageRank
Soumyadeep Dey1123.00
Jayanta Mukhopadhyay27226.05
Shamik Sural3100896.36
Partha Bhowmick460.84