Title
Deep Interactive Video Inpainting: An Invisibility Cloak for Harry Potter
Abstract
ABSTRACTIn this paper, we propose a new task of deep interactive video inpainting and an application for users to interact with machines. To our best knowledge, this is the first deep learning-based interactive video inpainting framework that only uses a free form of user input as guidance (i.e. scribbles) instead of mask annotations, which has academic, entertainment, and commercial value. With users' scribbles on a certain frame, it simultaneously performs interactive video object segmentation and video inpainting throughout the whole video. To achieve this, we utilize a shared spatial-temporal memory module, which combines both segmentation and inpainting into an end-to-end pipeline. In our framework, the past frames with object masks (either the users' scribbles or the predicted masks) constitute an external memory, and the current frame as the query is segmented and inpainted by reading the visual cues stored in that memory. Furthermore, our method allows users to iteratively refine the segmentation results, which effectively improves the inpainting performance with frames where inferior segmentation results are witnessed. Hence, one could obtain high-quality video inpainting results even with challenging video sequences. Qualitative and quantitative experimental results demonstrate the superiority of our approach.
Year
DOI
Venue
2021
10.1145/3474085.3475262
International Multimedia Conference
DocType
Citations 
PageRank 
Conference
0
0.34
References 
Authors
0
8
Name
Order
Citations
PageRank
Cheng Chen100.34
Jiayin Cai200.68
Yao Hu321616.71
Xu Tang42210.14
Xinggang Wang572848.02
Chun Yuan600.34
Xiang Bai73517149.87
Song Bai853333.91