Title | ||
---|---|---|
Evaluation of different feature sets in an OCR free method for word spotting in printed documents |
Abstract | ||
---|---|---|
This paper presents the evaluation of tree feature sets in an OCR free word spotting method under a strong experimental protocol. Different feature sets are evaluated under the same experimental conditions. In addition, a tuning process in the document segmentation step is proposed which provides a significant reduction in terms of the processing time. For this purpose, a complete OCR-free method for word spotting in printed documents was implemented, and a document database containing document images and their corresponding ground truth text files was created. A strong experimental protocol based on 800 document images allows us to compare the results of the three feature sets used to represent the word image. |
Year | DOI | Venue |
---|---|---|
2010 | 10.1145/1774088.1774099 | SAC |
Keywords | Field | DocType |
document segmentation step,different feature set,strong experimental protocol,ocr free method,printed document,word image,ocr free word,tree feature set,experimental condition,document database,document image,indexation,ground truth,document retrieval | Pattern recognition,Document clustering,Computer science,Document segmentation,Keyword spotting,Ground truth,Artificial intelligence,Spotting | Conference |
Citations | PageRank | References |
0 | 0.34 | 7 |
Authors | ||
4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Israel Rios | 1 | 0 | 0.68 |
Alceu Britto | 2 | 94 | 18.30 |
Alessandro L. Koerich | 3 | 525 | 39.59 |
Luiz S. Oliveira | 4 | 476 | 47.22 |