Abstract | ||
---|---|---|
Historical documents frequently suffer from arbitrary geometric distortions (warping and folds) due to storage conditions, use and to, some extent, the printing process of the time. In addition, page curl can be prominent due to the scanning technique used. Such distortions adversely affect OCR and print-on-demand quality. Previous approaches to geometric restoration either focus only on the correction of page curl or require supplementary information obtained by additional scanning hardware--not practical for existing scans. This paper presents a new approach to detect and restore arbitrary warping and folds, in addition to page curl. Warped text lines and the smooth deformation between them are precisely modelled as primary and secondary flow lines that are then restored to their original linear shape. Preliminary, but representative, experimental results, in comparison to a leading page curl removal method and an industry-standard commercial system, demonstrate the effectiveness of the proposed method. |
Year | DOI | Venue |
---|---|---|
2011 | 10.1109/ICDAR.2011.184 | ICDAR-1 |
Keywords | Field | DocType |
flow lines,removal method,arbitrarily warped historical document,leading page,geometric restoration,warped text line,historical document,industry-standard commercial system,arbitrary warping,arbitrary geometric distortion,optical character recognition,text analysis | Computer vision,Image warping,Computer science,Document image processing,Flow (psychology),Optical character recognition,Artificial intelligence,Curl (mathematics),Historical document | Conference |
ISSN | Citations | PageRank |
1520-5363 | 3 | 0.44 |
References | Authors | |
10 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Maryam Rahnemoonfar | 1 | 19 | 6.57 |
Apostolos Antonacopoulos | 2 | 378 | 36.45 |