Title
An Image Database Of Handwritten Bangla Words With Automatic Benchmarking Facilities For Character Segmentation Algorithms
Abstract
Recognition of unconstrained handwritten word images is an interesting research problem which gets more challenging when lexicon-free words are considered. Prerequisite for developing a lexicon-free handwritten word recognition technique is the segmentation of a word image into its constituent character set. Therefore, a competent character segmentation technique is required to design a comprehensive word recognition module. However, the literature study reveals that there is no standard word image database with ground truth information. As a result, most character segmentation algorithms found in the literature rely on self-made databases with manual evaluation. To fill the research need, in the present scope of the work, a comprehensive database consisting of handwritten Bangla word images is prepared primarily for evaluating any character segmentation algorithms. Additionally, the present work also provides two types of ground truth images related to segmented character shapes of the word images. Besides, an evaluation tool is developed for assessing the performance of any character segmentation algorithm on the developed benchmark database. The benchmark result, as found here, is 0.9212 (F-score) which outperforms some state-of-the-art methods.
Year
DOI
Venue
2021
10.1007/s00521-020-04981-w
NEURAL COMPUTING & APPLICATIONS
Keywords
DocType
Volume
Character segmentation, Handwritten word, Bangla script, Image database, Word recognition
Journal
33
Issue
ISSN
Citations 
1
0941-0643
0
PageRank 
References 
Authors
0.34
0
5
Name
Order
Citations
PageRank
Samir Malakar1227.90
Ram Sarkar242068.85
Subhadip Basu338543.75
Mahantapas Kundu442039.26
Mita Nasipuri5725107.01