Title
A novel form detection and removal scheme for document images
Abstract
In this work, we propose a novel algorithm to extract form structure from form documents. First, an effective image binarization method is proposed as a preprocessor for the purpose of color quantization, which is able to deal with complex images by adaptively taking the global and local image characteristics into account. Then, a form feature extraction unit modeled by Markov random field (MRF) is followed to extract the form feature. It estimates the probability of each pixel from its neighborhood state, and recognizes the form structure based on the probability result. Unlike the geometric-based method in literature, our probability-based form feature extraction algorithm is more effective and can successfully recognize any size of form cells. In the experiment, our method has been integrated into a digital archiving system to evaluate its effectiveness in terms of both the visual and f-measurement numerical performance.
Year
DOI
Venue
2010
10.1109/ICIP.2010.5651971
ICIP
Keywords
Field
DocType
gibbs random field,probability estimation,document images,probability-based form feature extraction algorithm,quantisation (signal),digital archiving,form feature extraction unit,form removal,form detection,image recognition,feature extraction,form extraction,object detection,image binarization,digital archiving system,f-measurement numerical performance,markov random field,markov processes,effective image binarization method,information retrieval systems,document image processing,color quantization,image colour analysis,probability,pattern recognition,algorithm design and analysis,pixel,classification algorithms,random field
Computer vision,Object detection,Markov process,Algorithm design,Pattern recognition,Computer science,Markov random field,Feature extraction,Preprocessor,Artificial intelligence,Pixel,Color quantization
Conference
ISSN
ISBN
Citations 
1522-4880 E-ISBN : 978-1-4244-7993-1
978-1-4244-7993-1
0
PageRank 
References 
Authors
0.34
3
2
Name
Order
Citations
PageRank
Tien-Ying Kuo114819.24
Yi-Chung Lo2325.06