Title | ||
---|---|---|
An annotation assistance system using an unsupervised codebook composed of handwritten graphical multi-stroke symbols |
Abstract | ||
---|---|---|
Many present recognition systems take advantage of ground-truthed datasets for training, evaluating and testing. But the creation of ground-truthed datasets is a tedious task. This paper proposes an iterative unsupervised handwritten graphical symbols learning framework which can be used for assisting such a labeling task. Initializing each stroke as a segment, we construct a relational graph between the segments where the nodes are the segments and the edges are the spatial relations between them. To extract the relevant patterns, a quantization of segments and spatial relations is implemented. Discovering graphical symbols becomes then the problem of finding the sub-graphs according to the Minimum Description Length (MDL) principle. The discovered graphical symbols will become the new segments for the next iteration. In each iteration, the quantization of segments yields the codebook in which the user can label graphical symbols. This original method has been first applied on a dataset of simple mathematical expressions. The results reported in this work show that only 58.2% of the strokes have to be manually labeled. |
Year | DOI | Venue |
---|---|---|
2014 | 10.1016/j.patrec.2012.11.018 | Pattern Recognition Letters |
Keywords | Field | DocType |
annotation assistance system,segments yield,minimum description length,original method,graphical symbol,ground-truthed datasets,tedious task,spatial relation,handwritten graphical multi-stroke symbol,handwritten graphical symbol,next iteration,unsupervised codebook,new segment,spatial relations | Spatial relation,Graph,Annotation,Expression (mathematics),Pattern recognition,Computer science,Minimum description length,Artificial intelligence,Initialization,Quantization (signal processing),Codebook | Journal |
Volume | ISSN | Citations |
35, | 0167-8655 | 4 |
PageRank | References | Authors |
0.43 | 17 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Jinpeng Li | 1 | 20 | 3.81 |
Harold Mouchère | 2 | 107 | 14.46 |
Christian Viard-Gaudin | 3 | 444 | 46.20 |