Title
One-Shot Learning-Based Animal Video Segmentation
Abstract
Deep learning-based video segmentation methods can offer a good performance after being trained on the large-scale pixel labeled datasets. However, a pixel-wise manual labeling of animal images is challenging and time consuming due to irregular contours and motion blur. To achieve desirable tradeoffs between the accuracy and speed, a novel one-shot learning-based approach is proposed in this article to segment animal video with only one labeled frame. The proposed approach consists of the following three main modules: guidance frame selection utilizes “BubbleNet” to choose one frame for manual labeling, which can leverage the fine-tuning effects of the only labeled frame; Xception-based fully convolutional network localizes dense prediction using depthwise separable convolutions based on one single labeled frame; and postprocessing is used to remove outliers and sharpen object contours, which consists of two submodules—test time augmentation and conditional random field. Extensive experiments have been conducted on the DAVIS 2016 animal dataset. Our proposed video segmentation approach achieved mean intersection-over-union score of 89.5% on the DAVIS 2016 animal dataset with less run time, and outperformed the state-of-art methods (OSVOS and OSMN). The proposed one-shot learning-based approach achieves real-time and automatic segmentation of animals with only one labeled video frame. This can be potentially used further as a baseline for intelligent perception-based monitoring of animals and other domain-specific applications. <xref ref-type="fn" rid="fn1" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"><sup>1</sup></xref> <fn id="fn1" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"><label><sup>1</sup></label><p>The source code, datasets, and pre-trained weights for this work are publicly [Online]. Available: <uri>https://github.com/tengfeixue-victor/One-Shot-Animal-Video-Segmentation</uri>.</p></fn>
Year
DOI
Venue
2022
10.1109/TII.2021.3117020
IEEE Transactions on Industrial Informatics
Keywords
DocType
Volume
Animal monitoring,convolutional neural network (CNN),deep learning,one-shot learning,video segmentation
Journal
18
Issue
ISSN
Citations 
6
1551-3203
0
PageRank 
References 
Authors
0.34
13
7
Name
Order
Citations
PageRank
Tengfei Xue100.34
Yongliang Qiao200.34
He Kong300.34
Daobilige Su400.34
Shirui Pan582069.37
Khalid Rafique600.34
Salah Sukkarieh71142141.84