Title
Optical modelling and language modelling trade-off for Handwritten Text Recognition
Abstract
Training the models needed for Automatic Handwritten Text Recognition of historical documents generally requires a significant amount of human effort. This is mainly due to the great differences that often exist between collections and to the lack of linguistic resources from the period when the documents were written, which results in a need of manual data labelling effort. This paper presents a study on the reuse of models trained with data from a different collection, focusing on the contribution that the language model and the optical models have on the performance. An empirical evaluation is performed using data from Jeremy Bentham manuscripts with the aim of recognising a manuscript about a very different topic written by Jane Austen.
Year
DOI
Venue
2015
10.1109/ICDAR.2015.7333878
International Conference on Document Analysis and Recognition
Keywords
Field
DocType
Handwritten Text Recognition, Optical Models, Language Models, Model Retraining
Reuse,Computer science,Natural language processing,Artificial intelligence,Language modelling,Text recognition,Language model
Conference
ISSN
Citations 
PageRank 
1520-5363
0
0.34
References 
Authors
9
3
Name
Order
Citations
PageRank
Mauricio Villegas121919.25
Joan-Andreu Sánchez219829.00
E. Vidal345449.15