Title | ||
---|---|---|
Visual Understanding and Narration: A Deeper Understanding and Explanation of Visual Scenes. |
Abstract | ||
---|---|---|
We describe the task of Visual Understanding and Narration, in which a robot (or agent) generates text for the images that it collects when navigating its environment, by answering open-ended questions, such as 'what happens, or might have happened, here?' |
Year | Venue | DocType |
---|---|---|
2019 | arXiv: Computation and Language | Journal |
Volume | Citations | PageRank |
abs/1906.00038 | 0 | 0.34 |
References | Authors | |
0 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Stephanie M. Lukin | 1 | 93 | 9.96 |
Claire Bonial | 2 | 232 | 18.02 |
Clare R. Voss | 3 | 344 | 29.51 |