Title
An annotation assistance system using an unsupervised codebook composed of handwritten graphical multi-stroke symbols
Abstract
Many present recognition systems take advantage of ground-truthed datasets for training, evaluating and testing. But the creation of ground-truthed datasets is a tedious task. This paper proposes an iterative unsupervised handwritten graphical symbols learning framework which can be used for assisting such a labeling task. Initializing each stroke as a segment, we construct a relational graph between the segments where the nodes are the segments and the edges are the spatial relations between them. To extract the relevant patterns, a quantization of segments and spatial relations is implemented. Discovering graphical symbols becomes then the problem of finding the sub-graphs according to the Minimum Description Length (MDL) principle. The discovered graphical symbols will become the new segments for the next iteration. In each iteration, the quantization of segments yields the codebook in which the user can label graphical symbols. This original method has been first applied on a dataset of simple mathematical expressions. The results reported in this work show that only 58.2% of the strokes have to be manually labeled.
Year
DOI
Venue
2014
10.1016/j.patrec.2012.11.018
Pattern Recognition Letters
Keywords
Field
DocType
annotation assistance system,segments yield,minimum description length,original method,graphical symbol,ground-truthed datasets,tedious task,spatial relation,handwritten graphical multi-stroke symbol,handwritten graphical symbol,next iteration,unsupervised codebook,new segment,spatial relations
Spatial relation,Graph,Annotation,Expression (mathematics),Pattern recognition,Computer science,Minimum description length,Artificial intelligence,Initialization,Quantization (signal processing),Codebook
Journal
Volume
ISSN
Citations 
35,
0167-8655
4
PageRank 
References 
Authors
0.43
17
3
Name
Order
Citations
PageRank
Jinpeng Li1203.81
Harold Mouchère210714.46
Christian Viard-Gaudin344446.20