Title
Active Learning in Handwritten Text Recognition using the Derivational Entropy
Abstract
Handwritten Text Recognition systems are based on statistical models such as recurrent neural networks or hidden Markov models for optical modeling of characters. These models need large corpora for training, consisting in text line images with their corresponding transcripts. The manual annotation of this training data is expensive because it is carried out by experts in paleography, who are specialized in reading ancient scripts. An alternative to reduce the annotation human effort is to use Active Learning techniques to selecting the most informative samples to be used for training. In this paper we study an Active Learning technique to selecting the most informative samples in an HTR scenario. The expert paleographer transcribes only the most informative samples in each stage. The technique followed here is based in the derivational entropy computed from word-graphs obtained from the recognition process.
Year
DOI
Venue
2018
10.1109/ICFHR-2018.2018.00058
2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR)
Keywords
Field
DocType
Handwritten Text Recognition,Derviational Entropy,Word Graph Normalization
Annotation,Active learning,Computer science,Recurrent neural network,Statistical model,Artificial intelligence,Hidden Markov model,Text recognition,Machine learning,Scripting language,Optical modeling
Conference
ISSN
ISBN
Citations 
2167-6445
978-1-5386-5876-5
0
PageRank 
References 
Authors
0.34
11
3
Name
Order
Citations
PageRank
Verónica Romero-Gomez101.01
Joan-Andreu Sánchez219829.00
Alejandre Héctor Toselli300.34