Title
Can One Deep Learning Model Learn Script-Independent Multilingual Word-Spotting?
Abstract
Word spotting has gained increased attention lately as it can be used to extract textual information from handwritten documents and scene-text images. Current word spotting approaches are designed to work on a single language and/or script. Building intelligent models that learn script-independent multilingual word-spotting is challenging due to the large variability of multilingual alphabets and symbols. We used ResNet-152 and the Pyramidal Histogram of Characters (PHOC) embedding to build a one-model script-independent multilingual word-spotting and we tested it on Latin, Arabic, and Bangla (Indian) languages. The one-model we propose performs on par with the multi-model language-specific word-spotting system, and thus, reduces the number of models needed for each script and/or language.
Year
DOI
Venue
2019
10.1109/ICDAR.2019.00050
2019 International Conference on Document Analysis and Recognition (ICDAR)
Keywords
Field
DocType
Scene text images, Handwriting, Histogram of Characters, PHOC, Multitasking, ResNet
Histogram,Embedding,Pattern recognition,Handwriting,Computer science,Textual information,Bengali,Natural language processing,Artificial intelligence,Deep learning,Human multitasking,Spotting
Conference
ISSN
ISBN
Citations 
1520-5363
978-1-7281-3015-6
0
PageRank 
References 
Authors
0.34
14
3
Name
Order
Citations
PageRank
Mohammed Al-Rawi100.34
Ernest Valveny264741.65
Dimosthenis Karatzas340638.13