Title
Improving Handwritten Arabic Text Recognition Using an Adaptive Data-Augmentation Algorithm
Abstract
Deep learning has increased the performance of classification and object detection, but it generally requires large amounts of labeled data for training. In this paper, we introduce a new data augmentation algorithm that promotes diversity between classes, representing the characters of the Arabic script, and can balance samples between different classes. This algorithm gives each word in the lexicon a weight. The weight of a word is based on the occurrence probabilities of the characters constituting the word. Minority classes are given higher weight as compared to the classes frequently occurring in the text. The data augmentation technique was evaluated on a handwritten word recognition task using the publicly available IFN/ENIT and AHDB datasets. We see significant improvement in results by employing our data augmentation technique, and we achieve state-of-the-art results on both datasets.
Year
DOI
Venue
2021
10.1007/978-3-030-86198-8_23
DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021 WORKSHOPS, PT I
Keywords
DocType
Volume
Handwriting recognition, Deep Learning Neural Network, Data augmentation, Recurrent Neural Network, Connectionist temporal classification
Conference
12916
ISSN
Citations 
PageRank 
0302-9743
0
0.34
References 
Authors
0
4
Name
Order
Citations
PageRank
Mohamed Eltay100.34
Abdelmalek B. C. Zidouri23010.37
Irfan Ahmad312310.13
Yousef Elarian400.68