Zero-Shot Learning Based Approach For Medieval Word Recognition using Deep-Learned Features - Citegraph

Paper Info

Title
Zero-Shot Learning Based Approach For Medieval Word Recognition using Deep-Learned Features

Abstract
Historical manuscripts reflect our past. Recently digitization of large quantities of historical handwritten documents is taking place in every corner of the world, and are being archived. From those digital repositories, automatic text indexing and retrieval system fetch only those documents to an end user that they are interested in. A regular OCR technology is not capable of rendering this service to an end user in a reliable manner. Instead, a word recognition/spotting algorithm performs the task. Word recognition based systems require enough labelled data per class to train the system. Moreover, all word classes need to be taught beforehand. Though word spotting could evade this drawback of prior training, these systems often need to have additional overheads like a language model to deal with "out of lexicon" words. Zero-shot learning could be a possible alternative to counter such situation. A Zero-shot learning algorithm is capable of handling unseen classes, provided the algorithm has been fortified with rich discriminating features and reliable "attribute description" per class during training. Since deeply learned features have enough discriminating power, a deep learning framework has been used here for feature extraction purpose. To the best of our knowledge, this is probably the first work on "out of lexicon" medieval word recognition using a Zero-Shot Learning framework. We obtained very encouraging results(accuracy ≈57% for "out of lexicon" classes) while dealing with 166 training classes and 50 unseen test classes.

Year	DOI	Venue
2018	10.1109/ICFHR-2018.2018.00067	2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR)
Keywords	Field	DocType
Zero shot learning for word recogntion,Out of lexicon word recognition	Digitization,Computer science,Word recognition,Search engine indexing,Feature extraction,Lexicon,Natural language processing,Artificial intelligence,Deep learning,Rendering (computer graphics),Language model,Machine learning	Conference
ISSN	ISBN	Citations
2167-6445	978-1-5386-5876-5	1
PageRank	References	Authors
0.36	4	6

Authors (6 rows)

Cited by (1 rows)

References (4 rows)

Name	Order	Citations	PageRank
Sukalpa Chanda	1	93	11.20
Jochem Baas	2	1	0.36
Daniel Haitink	3	1	0.36
Sebastien Hamel	4	2	1.42
Dominique Stutzmann	5	3	3.45
Lambert Schomaker Member	6	1309	87.50

1