Deep Interactive Video Inpainting: An Invisibility Cloak for Harry Potter - Citegraph

Paper Info

Title
Deep Interactive Video Inpainting: An Invisibility Cloak for Harry Potter

Abstract
ABSTRACTIn this paper, we propose a new task of deep interactive video inpainting and an application for users to interact with machines. To our best knowledge, this is the first deep learning-based interactive video inpainting framework that only uses a free form of user input as guidance (i.e. scribbles) instead of mask annotations, which has academic, entertainment, and commercial value. With users' scribbles on a certain frame, it simultaneously performs interactive video object segmentation and video inpainting throughout the whole video. To achieve this, we utilize a shared spatial-temporal memory module, which combines both segmentation and inpainting into an end-to-end pipeline. In our framework, the past frames with object masks (either the users' scribbles or the predicted masks) constitute an external memory, and the current frame as the query is segmented and inpainted by reading the visual cues stored in that memory. Furthermore, our method allows users to iteratively refine the segmentation results, which effectively improves the inpainting performance with frames where inferior segmentation results are witnessed. Hence, one could obtain high-quality video inpainting results even with challenging video sequences. Qualitative and quantitative experimental results demonstrate the superiority of our approach.

Year	DOI	Venue
2021	10.1145/3474085.3475262	International Multimedia Conference
DocType	Citations	PageRank
Conference	0	0.34
References	Authors
0	8

Authors (8 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Cheng Chen	1	0	0.34
Jiayin Cai	2	0	0.68
Yao Hu	3	216	16.71
Xu Tang	4	22	10.14
Xinggang Wang	5	728	48.02
Chun Yuan	6	0	0.34
Xiang Bai	7	3517	149.87
Song Bai	8	533	33.91

1