Hierarchical Multimodal Attention for End-to-End Audio-Visual Scene-Aware Dialogue Response Generation - Citegraph

Paper Info

Title
Hierarchical Multimodal Attention for End-to-End Audio-Visual Scene-Aware Dialogue Response Generation

Abstract
•Hierarchical attention, including question self-attention and question-guided attention on input helps to improve the model performance.•Features of multiple modalities can be fused through nonlinear approaches for better contextual representations.•Input video length, question types in user queries, and turn positions affect the quality of the generated responses.

Year	DOI	Venue
2020	10.1016/j.csl.2020.101095	Computer Speech & Language
Keywords	DocType	Volume
Dialogue system,Audio-visual scene-aware dialogue,Neural network,Multimodal attention,Response generation	Journal	63
ISSN	Citations	PageRank
0885-2308	0	0.34
References	Authors
0	4

Authors (4 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Hung Le	1	3	3.09
Doyen Sahoo	2	83	9.94
Nancy F. Chen	3	120	28.98
Steven C. H. Hoi	4	3830	174.61

1