Title
A Text-Line Segmentation Method for Historical Tibetan Documents Based on Baseline Detection.
Abstract
Text-line segmentation is an important task in the historical Tibetan document recognition. Historical Tibetan document images usually contain touching or overlapping characters between consecutive text-lines, making text-line segmentation a difficult task. In this paper, we present a text-line segmentation method based on baseline detection. The initial positions for the baseline of each line are obtained by template matching, pruning algorithms and closing operation. The baseline is estimated using dynamic tracing within pixel points of each line and the context information between pixel points. The overlapping or touching areas are cut by finding the minimum width stroke. Finally, text-lines are extracted based on the estimated baseline and the cut position of touching area. The proposed algorithm has been evaluated on the dataset of historical Tibetan document images. Experimental result shows the effectiveness of the proposed method.
Year
DOI
Venue
2017
10.1007/978-981-10-7299-4_29
Communications in Computer and Information Science
Keywords
DocType
Volume
Historical Tibetan document,Text-line segmentation,Baseline detection
Conference
771
ISSN
Citations 
PageRank 
1865-0929
0
0.34
References 
Authors
0
4
Name
Order
Citations
PageRank
Yanxing Li100.34
Long-Long Ma200.68
Lijuan Duan321526.13
Jian Wu494.12