Good Visual Guidance Make A Better Extractor: Hierarchical Visual Prefix for Multimodal Entity and Relation Extraction. - Citegraph

Paper Info

Title
Good Visual Guidance Make A Better Extractor: Hierarchical Visual Prefix for Multimodal Entity and Relation Extraction.

Abstract
Multimodal named entity recognition and relation extraction (MNER and MRE) is a fundamental and crucial branch in information extraction. However, existing approaches for MNER and MRE usually suffer from error sensitivity when irrelevant object images incorporated in texts. To deal with these issues, we propose a novel Hierarchical Visual Prefix fusion NeTwork (HVPNeT) for visual-enhanced entity and relation extraction, aiming to achieve more effective and robust performance. Specifically, we regard visual representation as pluggable visual prefix to guide the textual representation for error insensitive forecasting decision. We further propose a dynamic gated aggregation strategy to achieve hierarchical multi-scaled visual features as visual prefix for fusion. Extensive experiments on three benchmark datasets demonstrate the effectiveness of our method, and achieve state-of-the-art performance. Code is available in https://github.com/zjunlp/HVPNeT.

Year	DOI	Venue
2022	10.18653/v1/2022.findings-naacl.121	The Annual Conference of the North American Chapter of the Association for Computational Linguistics
DocType	Citations	PageRank
Conference	0	0.34
References	Authors
0	9

Authors (9 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Xiang Chen	1	46	4.34
Ningyu Zhang	2	63	18.56
Li, Lei	3	799	69.54
Yunzhi Yao	4	0	1.01
Shumin Deng	5	32	10.61
Chuanqi Tan	6	29	9.25
Fei Huang	7	2	7.54
Luo Si	8	2498	169.52
Huanhuan Chen	9	731	101.79

1