Visual Storytelling. - Citegraph

Paper Info

Title
Visual Storytelling.

Abstract
We introduce the first dataset for sequential vision-to-language, and explore how this data may be used for the task of visual storytelling. The first release of this dataset, SIND v.1, includes 81,743 unique photos in 20,211 sequences, aligned to both descriptive (caption) and story language. We establish several strong baselines for the storytelling task, and motivate an automatic metric to benchmark progress. Modelling concrete description as well as figurative and social language, as provided in this dataset and the storytelling task, has the potential to move artificial intelligence from basic understandings of typical visual scenes towards more and more human-like understanding of grounded event structure and subjective expression.

Year	Venue	DocType
2016	HLT-NAACL	Conference
Volume	Citations	PageRank
abs/1604.03968	0	0.34
References	Authors
0	15

Authors (15 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Ting-Hao Huang	1	146	19.23
Francis Ferraro	2	14	4.98
Nasrin Mostafazadeh	3	86	7.26
Ishan Misra	4	201	12.69
Aishwarya Agrawal	5	360	10.62
Jacob Devlin	6	738	32.34
Ross B. Girshick	7	21921	927.22
Xiaodong He	8	3858	190.28
Pushmeet Kohli	9	7398	332.84
Dhruv Batra	10	2142	104.81
C. Lawrence Zitnick	11	7321	332.72
Devi Parikh	12	2929	132.01
Lucy Vanderwende	13	1051	79.54
Michel Galley	14	2154	96.04
Margaret Mitchell	15	1450	65.37

1