Title
Semi-Supervised Wide-Angle Portraits Correction by Multi-Scale Transformer
Abstract
We propose a semi-supervised network for wide-angle portraits correction. Wide-angle images often suffer from skew and distortion affected by perspective distortion, especially noticeable at the face regions. Previous deep learning based approaches need the ground-truth correction flow maps for training guidance. However, such labels are expensive, which can only be obtained manually. In this work, we design a semi-supervised scheme and build a high-quality unlabeled dataset with rich scenarios, allowing us to simultaneously use labeled and unlabeled data to improve performance. Specifically, our semi-supervised scheme takes advantage of the consistency mechanism, with several novel components such as direction and range consistency (DRC) and regression consistency (RC). Furthermore, different from the existing methods, we propose the Multi-Scale Swin-Unet (MS-Unet) based on the multi-scale swin transformer block (MSTB), which can simultaneously learn short-distance and long-distance information to avoid artifacts. Extensive experiments demonstrate that the proposed method is superior to the state-of-the-art methods and other representative baselines. The source code and dataset are available at https://github.corn/megvii-research/PortraitsCorrection
Year
DOI
Venue
2022
10.1109/CVPR52688.2022.01907
IEEE Conference on Computer Vision and Pattern Recognition
Keywords
DocType
Volume
Computational photography, Image and video synthesis and generation, Low-level vision
Conference
2022
Issue
Citations 
PageRank 
1
0
0.34
References 
Authors
0
6
Name
Order
Citations
PageRank
Fushun Zhu100.34
Shan Zhao200.34
Peng Wang3385106.03
Hao Wang400.34
Hua Yan500.34
Shuaicheng Liu636328.26