AI-VQA: Visual Question Answering based on Agent Interaction with Interpretability - Citegraph

Paper Info

Title
AI-VQA: Visual Question Answering based on Agent Interaction with Interpretability

Abstract
ABSTRACTVisual Question Answering (VQA) serves as a proxy for evaluating the scene understanding of an intelligent agent by answering questions about images. Most VQA benchmarks to date are focused on those questions that can be answered through understanding visual content in the scene, such as simple counting, visual attributes, and even a little challenging questions that require extra encyclopedic knowledge. However, humans have a remarkable capacity to reason dynamic interaction on the scene, which is beyond the literal content of an image and has not been investigated so far. In this paper, we propose Agent Interaction Visual Question Answering (AI-VQA), a task investigating deep scene understanding if the agent takes a certain action. For this task, a model not only needs to answer action-related questions but also to locate the objects in which the interaction occurs for guaranteeing it truly comprehends the action. Accordingly, we make a new dataset based on Visual Genome and ATOMIC knowledge graph, including more than 19,000 manually annotated questions, and will make it publicly available. Besides, we also provide an annotation of the reasoning path while developing the answer for each question. Based on the dataset, we further propose a novel method, called ARE, that can comprehend the interaction and explain the reason based on a given event knowledge base. Experimental results show that our proposed method outperforms the baseline by a clear margin.

Year	DOI	Venue
2022	10.1145/3503161.3548387	International Multimedia Conference
DocType	Citations	PageRank
Conference	0	0.34
References	Authors
0	9

Authors (9 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Rengang Li	1	3	5.46
Cong Xu	2	0	0.34
Zhenhua Guo	3	0	0.34
Baoyu Fan	4	3	2.75
Runze Zhang	5	0	0.34
Wei Liu	6	11	21.81
Yaqian Zhao	7	0	0.34
Weifeng Gong	8	0	0.34
Endong Wang	9	7	5.62

1