Title
Appearance and shape based image synthesis by conditional variational generative adversarial network
Abstract
Person image synthesis based on shape and appearance using deep generative models opens the door in mickle applications, such as person re-identification (ReID) and movie industry. The methods of image synthesis are driven by producing the image of an object directly, which fail to recover spatial deformations when images are generated. In this paper, we present a conditional variational generative adversarial network (CVGAN) to synthesize desired images guided by target shape by modeling the inherent interplay between shape and appearance. Firstly, the shape and appearance of the given images are disentangled by adopting variational inference, which enables us to generate person images with arbitrary shapes. Secondly, to preserve the details and generate photo-realistic images, the Kullback–Leibler (KL) loss is adopted to reduce the gap between the condition image and generated image. Thirdly, to prevent partly gradient vanishing problem for training our framework stably, we propose combined general learning method, where the discriminative network leverages least squares loss. In addition, we experiment on COCO, DeepFashion and Market-1501 datasets, and results demonstrate that VGAN significantly improves the synthesis of images on discriminability, diversity and quality over the existing methods.
Year
DOI
Venue
2020
10.1016/j.knosys.2019.105450
Knowledge-Based Systems
Keywords
Field
DocType
Image synthesis,Deep generative models,Variational inference,Generative adversarial network
Least squares,Generative adversarial network,Pattern recognition,Computer science,Inference,Image synthesis,Artificial intelligence,Generative grammar,Discriminative model,Machine learning
Journal
Volume
ISSN
Citations 
193
0950-7051
0
PageRank 
References 
Authors
0.34
0
7
Name
Order
Citations
PageRank
Ying Chen111516.65
Shixiong Xia210213.28
Jiaqi Zhao311715.77
Yong Zhou46112.72
Qiang Niu584.59
Rui Yao6969.89
Dongjun Zhu742.50