EDNets: Deep Feature Learning for Document Image Classification Based on Multi-view Encoder-Decoder Neural Networks - Citegraph

Paper Info

Title
EDNets: Deep Feature Learning for Document Image Classification Based on Multi-view Encoder-Decoder Neural Networks

Abstract
In document analysis, text document images classification is a challenging task in several fields of application, such as archiving old documents, administrative procedures, or security. In this context, visual appearance has been widely used for document classification and considered as a useful and relevant features for the classification. However, visual information is insufficient to achieve higher classification rates, where relevant additional features, including textual features can be leveraged to improve classification results. In this paper, we propose a multi-view deep representation learning which allows combining textual and visual-based information respectively measured through the text and visual document images. The multi-view deep representation learning is designed to find a deeply shared representation between textual and visual features by fusing them into a joint latent space where a classifier model is trained to classify the document images. Our experimental results demonstrate the ability of the proposed model to outperform competitive approaches and to produce promising results.

Year	DOI	Venue
2021	10.1007/978-3-030-86337-1_22	DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT IV
Keywords	DocType	Volume
Document image classification, Multi-view representation learning, Deep learning	Conference	12824
ISSN	Citations	PageRank
0302-9743	0	0.34
References	Authors
0	2

Authors (2 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Sellami, A.	1	6	2.51
Salvatore Tabbone	2	653	52.52

1