Title | ||
---|---|---|
End-To-End Audio Visual Scene-Aware Dialog Using Multimodal Attention-Based Video Features |
Abstract | ||
---|---|---|
In order for machines interacting with the real world to have conversations with users about the objects and events around them, they need to understand dynamic audiovisual scenes. The recent revolution of neural network models allows us to combine various modules into a single end-to-end differentiable network. As a result, Audio Visual Scene-Aware Dialog (AVSD) systems for real-world applications can be developed by integrating state-of-the-art technologies from multiple research areas, including end-to-end dialog technologies, visual question answering (VQA) technologies, and video description technologies. In this paper, we introduce a new data set of dialogs about videos of human behaviors, as well as an end-to-end Audio Visual Scene-Aware Dialog (AVSD) model, trained using this new data set, that generates responses in a dialog about a video. By using features that were developed for multimodal attention-based video description, our system improves the quality of generated dialog about dynamic video scenes. |
Year | Venue | Keywords |
---|---|---|
2018 | 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | Audio visual scene-aware dialog, Visual QA, Video description, End-to-end modeling |
Field | DocType | Volume |
Dialog box,Mel-frequency cepstrum,Question answering,Pattern recognition,Computer science,Visualization,Speech recognition,Feature extraction,Human behavior,Artificial intelligence,Artificial neural network,Encoding (memory) | Journal | abs/1806.08409 |
ISSN | Citations | PageRank |
1520-6149 | 5 | 0.46 |
References | Authors | |
8 | 13 |
Name | Order | Citations | PageRank |
---|---|---|---|
Chiori Hori | 1 | 439 | 61.06 |
Huda AlAmri | 2 | 8 | 1.21 |
Jue Wang | 3 | 18 | 3.78 |
Gordon Wichern | 4 | 93 | 14.97 |
Takaaki Hori | 5 | 408 | 45.58 |
Anoop Cherian | 6 | 231 | 20.90 |
Tim K. Marks | 7 | 281 | 19.41 |
Vincent Cartillier | 8 | 7 | 1.22 |
Raphael Gontijo Lopes | 9 | 18 | 3.34 |
Abhishek Das | 10 | 433 | 23.54 |
Irfan A. Essa | 11 | 4876 | 580.85 |
Dhruv Batra | 12 | 2142 | 104.81 |
Devi Parikh | 13 | 2929 | 132.01 |