Title
Progressive Attention Memory Network For Movie Story Question Answering
Abstract
This paper proposes the progressive attention memory network (PAMN) for movie story question answering (QA). Movie story QA is challenging compared to VQA in two aspects: (1) pinpointing the temporal parts relevant to answer the question is difficult as the movies are typically longer than an hour, (2) it has both video and subtitle where different questions require different modality to infer the answer. To overcome these challenges, PAMN involves three main features: (1) progressive attention mechanism that utilizes cues from both question and answer to progressively prune out irrelevant temporal parts in memory, (2) dynamic modality fusion that adaptively determines the contribution of each modality for answering the current question, and (3) belief correction answering scheme that successively corrects the prediction score on each candidate answer. Experiments on publicly available benchmark datasets, MovieQA and TVQA, demonstrate that each feature contributes to our movie story QA architecture, PAMN, and improves performance to achieve the state-of-the-art result. Qualitative analysis by visualizing the inference mechanism of PAMN is also provided.
Year
DOI
Venue
2019
10.1109/CVPR.2019.00853
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019)
Field
DocType
Volume
Architecture,Question answering,Computer science,Inference,Subtitle,Natural language processing,Artificial intelligence,Temporal parts,Machine learning
Journal
abs/1904.08607
ISSN
Citations 
PageRank 
1063-6919
5
0.40
References 
Authors
0
5
Name
Order
Citations
PageRank
Junyeong Kim162.77
Minuk Ma260.75
Kyungsu Kim361.08
Sungjin Kim415914.60
Chang D. Yoo537545.88