Title
Building Compact CNN-DBLSTM Based Character Models for Handwriting Recognition and OCR by Teacher-Student Learning
Abstract
Character models based on convolutional neural network (CNN) and deep bidirectional long short-term memory (DBLSTM) have achieved high recognition accuracy on various handwriting recognition (HWR) and OCR tasks. To deploy CNN-DBLSTM models in products, it is necessary to reduce the footprint and runtime latency as much as possible. In this paper, we use a teacher-student learning approach to achieve this goal, where a new objective function is proposed to match the extracted CNN feature sequences of the teacher and student models under the guidance of the succeeding LSTM layer. Experimental results on large scale English HWR and OCR tasks show that the learned small student model can achieve about 14.6x footprint reduction and 9.6x speedup without recognition accuracy degradation against the big teacher model.
Year
DOI
Venue
2018
10.1109/ICFHR-2018.2018.00033
2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR)
Keywords
Field
DocType
Teacher Student Learning,CNN DBLSTM,OCR,HWR
Convolutional neural network,Computer science,Latency (engineering),Handwriting recognition,Footprint,Artificial intelligence,Machine learning,Speedup,Student learning
Conference
ISSN
ISBN
Citations 
2167-6445
978-1-5386-5876-5
0
PageRank 
References 
Authors
0.34
10
5
Name
Order
Citations
PageRank
Haisong Ding192.57
Kai Chen2715.38
Wenping Hu3826.77
Meng Cai4688.24
Qiang Huo5109899.69