Title
Zero-Shot Learning Based Approach For Medieval Word Recognition using Deep-Learned Features
Abstract
Historical manuscripts reflect our past. Recently digitization of large quantities of historical handwritten documents is taking place in every corner of the world, and are being archived. From those digital repositories, automatic text indexing and retrieval system fetch only those documents to an end user that they are interested in. A regular OCR technology is not capable of rendering this service to an end user in a reliable manner. Instead, a word recognition/spotting algorithm performs the task. Word recognition based systems require enough labelled data per class to train the system. Moreover, all word classes need to be taught beforehand. Though word spotting could evade this drawback of prior training, these systems often need to have additional overheads like a language model to deal with "out of lexicon" words. Zero-shot learning could be a possible alternative to counter such situation. A Zero-shot learning algorithm is capable of handling unseen classes, provided the algorithm has been fortified with rich discriminating features and reliable "attribute description" per class during training. Since deeply learned features have enough discriminating power, a deep learning framework has been used here for feature extraction purpose. To the best of our knowledge, this is probably the first work on "out of lexicon" medieval word recognition using a Zero-Shot Learning framework. We obtained very encouraging results(accuracy ≈57% for "out of lexicon" classes) while dealing with 166 training classes and 50 unseen test classes.
Year
DOI
Venue
2018
10.1109/ICFHR-2018.2018.00067
2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR)
Keywords
Field
DocType
Zero shot learning for word recogntion,Out of lexicon word recognition
Digitization,Computer science,Word recognition,Search engine indexing,Feature extraction,Lexicon,Natural language processing,Artificial intelligence,Deep learning,Rendering (computer graphics),Language model,Machine learning
Conference
ISSN
ISBN
Citations 
2167-6445
978-1-5386-5876-5
1
PageRank 
References 
Authors
0.36
4
6
Name
Order
Citations
PageRank
Sukalpa Chanda19311.20
Jochem Baas210.36
Daniel Haitink310.36
Sebastien Hamel421.42
Dominique Stutzmann533.45
Lambert Schomaker Member6130987.50