Title | ||
---|---|---|
Detecting and locating partially specified keywords in scanned images using hidden Markov models |
Abstract | ||
---|---|---|
A hidden Markov model (HMM) based system for detecting locating, or spotting, user-specified keywords in scanned images is described. The system is font-independent, and no pre-segmentation of text and graphics is required. The bounding boxes of potential lines of text are extracted from the image using morphology. Feature vectors based on the external shape and internal structure of characters are computed for each bounding box. A keyword HMM is created by concatenating appropriate context-dependent character HMMs. The non-keyword HMM is based on context-dependent sub-character models. Keywords are spotted using Viterbi decoding on an HMM network created from the keyword and non-keyword HMMs. This model allows detection of keywords embedded in a line without pre-segmentation of the line into words or characters. Thus keywords may be specified by a baseform and variants of the keyword can be detected |
Year | DOI | Venue |
---|---|---|
1993 | 10.1109/ICDAR.1993.395765 | Tsukuba Science City |
Keywords | Field | DocType |
Viterbi decoding,feature extraction,hidden Markov models,word processing,Viterbi decoding,bounding box,bounding boxes,context-dependent character HMMs,context-dependent sub-character models,external shape,feature vectors,font-independent,hidden Markov model,internal structure,keyword HMM,morphology,partially specified keywords,scanned images,user-specified keywords | Graphics,Computer vision,Feature vector,Pattern recognition,Computer science,Image retrieval,Feature extraction,Image segmentation,Artificial intelligence,Hidden Markov model,Word processing,Minimum bounding box | Conference |
ISBN | Citations | PageRank |
0-8186-4960-7 | 26 | 2.72 |
References | Authors | |
9 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Francine Chen | 1 | 1218 | 153.96 |
Lynn Wilcox | 2 | 1330 | 180.16 |
Bloomberg, Dan S. | 3 | 39 | 4.43 |