Abstract | ||
---|---|---|
Unlike many other languages, 18 out of 32 Farsi characters have dots appearing in groups of one, two or three. Some of these letters share common primary shapes, differing only in the number of dots and whether the dots are above or below the primary shape. In this paper, a new concept of using dots in a cursively handwritten Farsi/Arabic word is introduced for lexicon reduction and a fast method for extracting dots is presented. The technique involves extraction and representation of number and position of dots from off-line handwritten words to eliminate unlikely candidates. Experimental results on a set of 12,000 handwritten word images yield a lexicon reduction of 93% with accuracy of 85%. The proposed lexicon reduction algorithm achieves the speedup factor of 2 as well as 13% improvement in recognition rate. |
Year | DOI | Venue |
---|---|---|
2008 | 10.1016/j.patrec.2007.11.009 | Pattern Recognition Letters |
Keywords | Field | DocType |
dot extraction,cursively handwritten farsi,off-line handwritten word,arabic handwritten word recognition,string matching,arabic word,common primary shape,lexicon reduction,discrete hidden markov model,primary shape,farsi character,proposed lexicon reduction algorithm,off-line farsi,handwritten word image,off-line farsi/arabic handwritten word recognition,hidden markov model | String searching algorithm,Speech processing,Arabic,Computer science,Lexico,Artificial intelligence,Natural language processing,Speedup,Pattern recognition,Word recognition,Speech recognition,Lexicon,Hidden Markov model | Journal |
Volume | Issue | ISSN |
29 | 6 | Pattern Recognition Letters |
Citations | PageRank | References |
12 | 0.64 | 20 |
Authors | ||
4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Saeed Mozaffari | 1 | 152 | 14.19 |
Karim Faez | 2 | 819 | 83.23 |
Volker Märgner | 3 | 295 | 29.02 |
Haikal El-Abed | 4 | 436 | 29.39 |