Title
An Iterative Refinement Framework for Image Document Binarization with Bhattacharyya Similarity Measure
Abstract
Background noise and illumination condition are two primary factors degrading the performance of document image binarization. In this paper, we propose an iterative refinement framework to support robust binarization. Initially, an input image is transformed into a Bhattacharyya similarity matrix with Gaussian kernel, which is subsequently converted into a binary image using maximum entropy classifier. Then, we adopt the run-length histogram to estimate the character stroke width, an important indicator to determine the length of filter window. After noise elimination, the output image is used for the next round of refinement and the process terminates when the estimated stroke width is stable. Extensive experiments were conducted on the standard DIBCO datasets as well as a new benchmark harvested from our user query log. Results show that our proposed method outperforms state-of-the-art methods and is more robust to handle low-quality images.
Year
DOI
Venue
2017
10.1109/ICDAR.2017.24
2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)
Keywords
Field
DocType
Image Binarization,Bhattacharyya Similarity
Iterative refinement,Histogram,Bhattacharyya distance,Background noise,Similarity measure,Pattern recognition,Computer science,Binary image,Feature extraction,Artificial intelligence,Gaussian function
Conference
Volume
ISSN
ISBN
01
1520-5363
978-1-5386-3587-2
Citations 
PageRank 
References 
1
0.36
3
Authors
9
Name
Order
Citations
PageRank
Ning Liu18831.20
Dongxiang Zhang274343.89
Xing Xu376462.73
Wenju Liu421439.32
Dengfeng Ke5126.51
Long Guo6654.17
Shengkun Shi710.36
Hui Liu811629.48
Lijiang Chen930423.22