Visual saliency and terminology extraction for document annotation - Citegraph

Paper Info

Title
Visual saliency and terminology extraction for document annotation

Abstract
The document digitization process becomes a crucial economical issue in our society. Then, it becomes necessary to be able to organize this huge amount of documents. The work proposed in this paper tends to propose a new method to automatically classify document using a saliency-based segmentation process on one hand, and a terminology extraction and annotation on the other hand. The saliency-based segmentation is used to extract salient regions and by the way logo, while the terminology approach is used to annotate them and to automatically classify the document. The approach does not require human expertise, and use Google Images as a knowledge database. The results obtained on a real database of 1766 documents show the relevance of the approach.

Year	DOI	Venue
2013	10.1145/2494266.2494299	ACM Symposium on Document Engineering
Keywords	DocType	Citations
real database,visual saliency,terminology extraction,document annotation,crucial economical issue,document digitization process,huge amount,saliency-based segmentation process,saliency-based segmentation,knowledge database,google images,terminology approach	Conference	0
PageRank	References	Authors
0.34	6	4

Authors (4 rows)

Cited by (0 rows)

References (6 rows)

Name	Order	Citations	PageRank
Benjamin Duthil	1	8	3.69
Mickael Coustaty	2	41	3.88
Vincent Courboulay	3	66	12.07
Jean-Marc Ogier	4	631	85.80

1