Title
Exploring More Representative States Of Hidden Markov Model In Optical Character Recognition: A Clustering-Based Model Pre-Training Approach
Abstract
Hidden Markov Model (HMM) is an effective method to describe sequential signals in many applications. As to model estimation issue, common training algorithm only focuses on the optimization of model parameters. However, model structure influences system performance as well. Although some structure optimization methods are proposed, they are usually implemented as an independent module before parameter optimization. In this paper, the clustering feature of states in HMM is discussed through comparing the mechanism of Quadratic Discriminant Function (QDF) classifier and HMM. Then, through the clustering effect of Viterbi training and Baum-Welch training, a novel clustering-based model pre-training approach is proposed. It can optimize model parameters and model structure by turns, until the representative states of all models are explored. Finally, the proposed approach is evaluated on two typical OCR applications, printed and handwritten Arabic text line recognition. And it is compared with some other optimization methods. The improvement of character recognition performance proves the proposed approach can make more precise state allocation. And the representative states are benefit to HMM decoding.
Year
DOI
Venue
2015
10.1142/S0218001415500147
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE
Keywords
Field
DocType
Hidden Markov model, optical character recognition, model estimation, state number optimization
Pattern recognition,Effective method,Markov model,Computer science,Optical character recognition,Artificial intelligence,Decoding methods,Cluster analysis,Hidden Markov model,Classifier (linguistics),Viterbi algorithm,Machine learning
Journal
Volume
Issue
ISSN
29
3
0218-0014
Citations 
PageRank 
References 
1
0.36
4
Authors
4
Name
Order
Citations
PageRank
Zhiwei Jiang1416.41
Xiaoqing Ding21219108.02
Liangrui Peng38017.67
Changsong Liu435836.20