Title
Document Preprocessing System - Automatic Selection of Binarization
Abstract
Due to the reason that historical documents present many degradations, the analysis of such documents is considered as a big challenge. In this paper we present a system which allows automatic preprocessing of historical documents. One or many preprocessing methods, as well as sets of input parameters are selected for each book from the used database according to the input image features. Such selection is tested on a subset of every book during the training step, the validation of the carried results is performed on another subset of images. If the validation is not well checked, the training is repeated. The proposed system is applied on a set of books from the Google-Books (23 books, 1000 images) and the Bayerische Staatsbibliothek (10 books, 750 images) collections. The performed results are very promising.
Year
DOI
Venue
2012
10.1109/DAS.2012.31
Document Analysis Systems
Keywords
DocType
ISBN
bayerische staatsbibliothek,used database,automatic selection,proposed system,preprocessing method,big challenge,input image feature,automatic preprocessing,historical document,document preprocessing system,training step,input parameter,frequency modulation,text analysis,databases,history,image features,measurement,psnr,feature extraction
Conference
978-1-4673-0868-7
Citations 
PageRank 
References 
4
0.51
16
Authors
4
Name
Order
Citations
PageRank
Ines Ben Messaoud1596.58
Hamid Amiri28619.36
Haikal El-Abed343629.39
Volker Margner41076.37