Abstract | ||
---|---|---|
A configurable archive document image analysis system for digital library construction has been designed using rapid prototyping and top-down iterative development methods. This approach has been found to be essential in order to capture the curators' expertise about existing card archive structures, content and databases. The design currently achieves about 93% correct segmentation of the required archive card fields overall, with 81.3% of all archive cards in a testset of 2000 images having all fields correctly segmented and labeled. Analysis of errors in the testset indicates that heavily-annotated cards and non-standard card formats comprise 5-10% of the overall archive, and a significant proportion of these are unlikely to be resolvable without curatorial intervention. |
Year | DOI | Venue |
---|---|---|
2003 | 10.1109/ICDAR.2003.1227715 | ICDAR-1 |
Keywords | Field | DocType |
digital library construction,user-assisted archive document image,overall archive,configurable archive document image,correct segmentation,archive card,thatheavily-annotated card,expertiseabout existing card archive,required archive card fieldsoverall,non-standard card formatscomprise,content anddatabases,top down,digital library,graphical user interfaces,digital libraries,image segmentation | Rapid prototyping,Information retrieval,Iterative and incremental development,Segmentation,Computer science,Image segmentation,Digital image,Graphical user interface,Digital library,Digital image processing,Multimedia | Conference |
ISSN | ISBN | Citations |
1520-5363 | 0-7695-1960-1 | 12 |
PageRank | References | Authors |
1.26 | 3 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
He, J. | 1 | 63 | 4.47 |
Andy C. Downton | 2 | 121 | 31.49 |