Title
Using codebooks of fragmented connected-component contours in forensic and historic writer identification
Abstract
Recent advances in 'off-line' writer identification allow for new applications in handwritten text retrieval from archives of scanned historical documents. This paper describes new algorithms for forensic or historical writer identification, using the contours of fragmented connected-components in free-style handwriting. The writer is considered to be characterized by a stochastic pattern generator, producing a family of character fragments (fraglets). Using a codebook of such fraglets from an independent training set, the probability distribution of fraglet contours was computed for an independent test set. Results revealed a high sensitivity of the fraglet histogram in identifying individual writers on the basis of a paragraph of text. Large-scale experiments on the optimal size of Kohonen maps of fraglet contours were performed, showing usable classification rates within a non-critical range of Kohonen map dimensions. The proposed automatic approach bridges the gap between image-statistics approaches and purely knowledge-based manual character-based methods.
Year
DOI
Venue
2007
10.1016/j.patrec.2006.08.005
Pattern Recognition Letters
Keywords
Field
DocType
cursive-script segmentation,independent test set,fragmented connected-component contour,writer identification,kohonen map dimension,author identification,fraglet contour,historical writer identification,handwritten text retrieval,historic writer identification,fraglet histogram,independent training set,kohonen map,individual writer,connected component,knowledge base,probability distribution
Histogram,Computer vision,Pattern recognition,Handwriting,Segmentation,Self-organizing map,Probability distribution,Connected component,Artificial intelligence,Mathematics,Codebook,Test set
Journal
Volume
Issue
ISSN
28
6
Pattern Recognition Letters
Citations 
PageRank 
References 
39
1.53
13
Authors
3
Name
Order
Citations
PageRank
Lambert Schomaker Member1130987.50
Katrin Franke253652.77
Marius Bulacu351424.17