Document Collection Visual Question Answering - Citegraph

Paper Info

Title
Document Collection Visual Question Answering

Abstract
Current tasks and methods in Document Understanding aims to process documents as single elements. However, documents are usually organized in collections (historical records, purchase invoices), that provide context useful for their interpretation. To address this problem, we introduce Document Collection Visual Question Answering (DocCVQA) a new dataset and related task, where questions are posed over a whole collection of document images and the goal is not only to provide the answer to the given question, but also to retrieve the set of documents that contain the information needed to infer the answer. Along with the dataset we propose a new evaluation metric and baselines which provide further insights to the new dataset and task.

Year	DOI	Venue
2021	10.1007/978-3-030-86331-9_50	DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2021, PT II
Keywords	DocType	Volume
Document collection, Visual Question Answering	Conference	12822
ISSN	Citations	PageRank
0302-9743	0	0.34
References	Authors
0	3

Authors (3 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Ruben Tito	1	10	2.18
Dimosthenis Karatzas	2	406	38.13
E. Valveny	3	196	11.82

1