Abstract | ||
---|---|---|
In the paper, we propose a method based on transfer learning to detect multi-lingual text in natural scenes. First, a semantic segmentation map of the input image is obtained through a fully convolution network (FCN). In this map, each pixel is classified to text or none-text. And then, the candidate boxes of text regions are computed based on the map. In this procedure, VGG network is trained to obtain a basic character classifier of single language. Based on this VGG model, FCN has the ability to classify each pixel to text or none-text for multi-lingual with doing transfer learning. Finally, the bounding boxes of text are carry out by filtering the unsatisfied candidates with some rules. The experimental results show that our method achieves good performance on the task of multi-lingual text detection. And compared with other advanced method, the time cost of our method is shortest. |
Year | DOI | Venue |
---|---|---|
2017 | 10.1007/978-3-319-77380-3_40 | ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT I |
Keywords | Field | DocType |
Sence text detection,Multi-language,Transfer learning,Fully convolution networks | Computer vision,Pattern recognition,Segmentation,Convolution,Computer science,Transfer of learning,Filter (signal processing),Pixel,Artificial intelligence,Classifier (linguistics),Text detection,Bounding overwatch | Conference |
Volume | ISSN | Citations |
10735 | 0302-9743 | 0 |
PageRank | References | Authors |
0.34 | 16 | 6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Shaohua Liu | 1 | 2 | 1.39 |
Yan Shang | 2 | 0 | 0.34 |
Jizhong Han | 3 | 355 | 54.72 |
Xi Wang | 4 | 0 | 1.69 |
Hongchao Gao | 5 | 0 | 2.70 |
Dongqin Liu | 6 | 0 | 0.34 |