Title
Learning to Segment Instances in Videos with Spatial Propagation Network.
Abstract
We propose a deep learning-based framework for instance-level object segmentation. Our method mainly consists of three steps. First, We train a generic model based on ResNet-101 for foreground/background segmentations. Second, based on this generic model, we fine-tune it to learn instance-level models and segment individual objects by using augmented object annotations in first frames of test videos. To distinguish different instances in the same video, we compute a pixel-level score map for each object from these instance-level models. Each score map indicates the objectness likelihood and is only computed within the foreground mask obtained in the first step. To further refine this per frame score map, we learn a spatial propagation network. This network aims to learn how to propagate a coarse segmentation mask spatially based on the pairwise similarities in each frame. In addition, we apply a filter on the refined score map that aims to recognize the best connected region using spatial and temporal consistencies in the video. Finally, we decide the instance-level object segmentation in each video by comparing score maps of different instances.
Year
Venue
Field
2017
arXiv: Computer Vision and Pattern Recognition
Pairwise comparison,Pattern recognition,Segmentation,Computer science,Artificial intelligence,Deep learning,Machine learning
DocType
Volume
Citations 
Journal
abs/1709.04609
2
PageRank 
References 
Authors
0.38
17
9
Name
Order
Citations
PageRank
Jingchun Cheng1403.32
Sifei Liu222717.54
Yi-Hsuan Tsai313818.08
Wei-Chih Hung4293.84
Shalini Gupta529920.42
Jinwei Gu620.38
Jan Kautz73615198.77
Shengjin Wang8145078.26
Yang Ming-Hsuan915303620.69