Title
Lexicon reduction using dots for off-line Farsi/Arabic handwritten word recognition
Abstract
Unlike many other languages, 18 out of 32 Farsi characters have dots appearing in groups of one, two or three. Some of these letters share common primary shapes, differing only in the number of dots and whether the dots are above or below the primary shape. In this paper, a new concept of using dots in a cursively handwritten Farsi/Arabic word is introduced for lexicon reduction and a fast method for extracting dots is presented. The technique involves extraction and representation of number and position of dots from off-line handwritten words to eliminate unlikely candidates. Experimental results on a set of 12,000 handwritten word images yield a lexicon reduction of 93% with accuracy of 85%. The proposed lexicon reduction algorithm achieves the speedup factor of 2 as well as 13% improvement in recognition rate.
Year
DOI
Venue
2008
10.1016/j.patrec.2007.11.009
Pattern Recognition Letters
Keywords
Field
DocType
dot extraction,cursively handwritten farsi,off-line handwritten word,arabic handwritten word recognition,string matching,arabic word,common primary shape,lexicon reduction,discrete hidden markov model,primary shape,farsi character,proposed lexicon reduction algorithm,off-line farsi,handwritten word image,off-line farsi/arabic handwritten word recognition,hidden markov model
String searching algorithm,Speech processing,Arabic,Computer science,Lexico,Artificial intelligence,Natural language processing,Speedup,Pattern recognition,Word recognition,Speech recognition,Lexicon,Hidden Markov model
Journal
Volume
Issue
ISSN
29
6
Pattern Recognition Letters
Citations 
PageRank 
References 
12
0.64
20
Authors
4
Name
Order
Citations
PageRank
Saeed Mozaffari115214.19
Karim Faez281983.23
Volker Märgner329529.02
Haikal El-Abed443629.39