Title
Evaluation of different feature sets in an OCR free method for word spotting in printed documents
Abstract
This paper presents the evaluation of tree feature sets in an OCR free word spotting method under a strong experimental protocol. Different feature sets are evaluated under the same experimental conditions. In addition, a tuning process in the document segmentation step is proposed which provides a significant reduction in terms of the processing time. For this purpose, a complete OCR-free method for word spotting in printed documents was implemented, and a document database containing document images and their corresponding ground truth text files was created. A strong experimental protocol based on 800 document images allows us to compare the results of the three feature sets used to represent the word image.
Year
DOI
Venue
2010
10.1145/1774088.1774099
SAC
Keywords
Field
DocType
document segmentation step,different feature set,strong experimental protocol,ocr free method,printed document,word image,ocr free word,tree feature set,experimental condition,document database,document image,indexation,ground truth,document retrieval
Pattern recognition,Document clustering,Computer science,Document segmentation,Keyword spotting,Ground truth,Artificial intelligence,Spotting
Conference
Citations 
PageRank 
References 
0
0.34
7
Authors
4
Name
Order
Citations
PageRank
Israel Rios100.68
Alceu Britto29418.30
Alessandro L. Koerich352539.59
Luiz S. Oliveira447647.22