Abstract | ||
---|---|---|
The prevalent scene text detection approach follows four sequential steps comprising character candidate detection, false character candidate removal, text line extraction, and text line verification. However, errors occur and accumulate throughout each of these sequential steps which often lead to low detection performance. To address these issues, we propose a unified scene text detection system, namely Text Flow, by utilizing the minimum cost (min-cost) flow network model. With character candidates detected by cascade boosting, the min-cost flow network model integrates the last three sequential steps into a single process which solves the error accumulation problem at both character level and text line level effectively. The proposed technique has been tested on three public datasets, i.e, ICDAR2011 dataset, ICDAR2013 dataset and a multilingual dataset and it outperforms the state-of-the-art methods on all three datasets with much higher recall and F-score. The good performance on the multilingual dataset shows that the proposed technique can be used for the detection of texts in different languages. |
Year | DOI | Venue |
---|---|---|
2015 | 10.1109/ICCV.2015.528 | ICCV '15 Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV) |
Field | DocType | Volume |
Flow network,Pattern recognition,Computer science,Flow (psychology),Speech recognition,Artificial intelligence,Cascade,Boosting (machine learning),Text detection | Conference | abs/1604.06877 |
Issue | ISSN | Citations |
1 | 1550-5499 | 44 |
PageRank | References | Authors |
1.09 | 27 | 6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Shangxuan Tian | 1 | 194 | 7.17 |
Yifeng Pan | 2 | 44 | 1.09 |
Chang Huang | 3 | 1867 | 94.82 |
Shijian Lu | 4 | 1346 | 93.57 |
Yu, Kai | 5 | 4799 | 255.21 |
Chew L Tan | 6 | 57 | 3.96 |