Title
The optical character recognition of Urdu-like cursive scripts
Abstract
We survey the optical character recognition (OCR) literature with reference to the Urdu-like cursive scripts. In particular, the Urdu, Pushto, and Sindhi languages are discussed, with the emphasis being on the Nasta'liq and Naskh scripts. Before detaining the OCR works, the peculiarities of the Urdu-like scripts are outlined, which are followed by the presentation of the available text image databases. For the sake of clarity, the various attempts are grouped into three parts, namely: (a) printed, (b) handwritten, and (c) online character recognition. Within each part, the works are analyzed par rapport a typical OCR pipeline with an emphasis on the preprocessing, segmentation, feature extraction, classification, and recognition.
Year
DOI
Venue
2014
10.1016/j.patcog.2013.09.037
Pattern Recognition
Keywords
Field
DocType
feature extraction,online character recognition,sindhi language,par rapport,available text image databases,optical character recognition,urdu-like script,naskh script,urdu-like cursive script,typical ocr pipeline,character
Cursive,Computer science,Optical character recognition,Feature extraction,Speech recognition,Urdu,Preprocessor,Artificial intelligence,Sindhi,Natural language processing,Intelligent word recognition,Scripting language
Journal
Volume
Issue
ISSN
47
3
0031-3203
Citations 
PageRank 
References 
30
1.13
37
Authors
6
Name
Order
Citations
PageRank
Saeeda Naz114714.25
Khizar Hayat224819.71
Muhammad Imran Razzak322133.86
Muhammad Waqas Anwar4342.92
Sajjad A. Madani568223.38
Samee U. Khan6157283.04