Abstract | ||
---|---|---|
Transcription of handwritten text in (old) documents is an important, time-consuming task for digital libraries. In this paper, an efficient interactive predictive transcription prototype called GIDOC (Gimp-based Interactive transcription of old text DOCuments) is presented. GIDOC is a first attempt to provide integrated support for interactive-predictive page layout analysis, text line detection and handwritten text transcription. It is based on GIMP and uses advanced techniques and tools for language and handwritten text modelling. Results are given on a real transcription task on a 764-page Spanish manuscript from 1891. |
Year | Venue | Keywords |
---|---|---|
2010 | PATTERN RECOGNITION IN INFORMATION SYSTEMS | natural language processing |
Field | DocType | Citations |
Computer science,Page layout analysis,Natural language processing,Artificial intelligence,Digital library,Machine learning | Conference | 4 |
PageRank | References | Authors |
0.47 | 9 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Nicolás Serrano | 1 | 277 | 32.84 |
Lionel Tarazón | 2 | 9 | 1.03 |
Daniel Pérez | 3 | 97 | 8.91 |
Oriol Ramos Terrades | 4 | 150 | 18.36 |
alfons juan | 5 | 572 | 61.45 |