Title | ||
---|---|---|
An Image Database Of Handwritten Bangla Words With Automatic Benchmarking Facilities For Character Segmentation Algorithms |
Abstract | ||
---|---|---|
Recognition of unconstrained handwritten word images is an interesting research problem which gets more challenging when lexicon-free words are considered. Prerequisite for developing a lexicon-free handwritten word recognition technique is the segmentation of a word image into its constituent character set. Therefore, a competent character segmentation technique is required to design a comprehensive word recognition module. However, the literature study reveals that there is no standard word image database with ground truth information. As a result, most character segmentation algorithms found in the literature rely on self-made databases with manual evaluation. To fill the research need, in the present scope of the work, a comprehensive database consisting of handwritten Bangla word images is prepared primarily for evaluating any character segmentation algorithms. Additionally, the present work also provides two types of ground truth images related to segmented character shapes of the word images. Besides, an evaluation tool is developed for assessing the performance of any character segmentation algorithm on the developed benchmark database. The benchmark result, as found here, is 0.9212 (F-score) which outperforms some state-of-the-art methods. |
Year | DOI | Venue |
---|---|---|
2021 | 10.1007/s00521-020-04981-w | NEURAL COMPUTING & APPLICATIONS |
Keywords | DocType | Volume |
Character segmentation, Handwritten word, Bangla script, Image database, Word recognition | Journal | 33 |
Issue | ISSN | Citations |
1 | 0941-0643 | 0 |
PageRank | References | Authors |
0.34 | 0 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Samir Malakar | 1 | 22 | 7.90 |
Ram Sarkar | 2 | 420 | 68.85 |
Subhadip Basu | 3 | 385 | 43.75 |
Mahantapas Kundu | 4 | 420 | 39.26 |
Mita Nasipuri | 5 | 725 | 107.01 |