Title
Text region extraction and text segmentation on camera-captured document style images
Abstract
In this paper, we propose a text extraction method from camera-captured document style images and propose a text segmentation method based on a color clustering method. The proposed extraction method detects text regions from the images using two low-level image features and verifies the regions through a high-level text stroke feature. The two level features are combined hierarchically. The low-level features are intensity variation and color variance. And, we use text strokes as a high-level feature using multi-resolution wavelet transforms on local image areas. The stroke feature vector is an input to a SVM (support vector machine) for verification, when needed. The proposed text segmentation method uses color clustering to the extracted text regions. We improved K-means clustering method and it selects K and initial seed values automatically. We tested the proposed methods with various document style images captured by three different cameras. We confirmed that the extraction rates are good enough to be used in real-life applications.
Year
DOI
Venue
2005
10.1109/ICDAR.2005.234
international conference on document analysis and recognition
Keywords
Field
DocType
character recognition,document image processing,feature extraction,image colour analysis,image segmentation,pattern clustering,support vector machines,text analysis,wavelet transforms,k-means clustering method,camera-captured document style images,color clustering method,image feature extraction,multiresolution wavelet transform,support vector machine,text region extraction,text segmentation,text stroke feature,image features,feature vector,k means clustering,wavelet transform
Computer vision,Feature vector,Text mining,Pattern recognition,Computer science,Feature (computer vision),Support vector machine,Image segmentation,Text segmentation,Feature extraction,Artificial intelligence,Cluster analysis
Conference
ISSN
ISBN
Citations 
1520-5263
0-7695-2420-6
7
PageRank 
References 
Authors
0.68
9
8
Name
Order
Citations
PageRank
yee jiun song170.68
ki chul kim270.68
Y. W. Choi3554.44
Hyeran Byun450565.97
Hyun Soo Kim56312.32
Suyoung Chi65712.88
D. -K. Jang791.14
Yun Koo Chung8453.87