Improved Variational Autoencoders for Text Modeling using Dilated Convolutions. - Citegraph

Paper Info

Title
Improved Variational Autoencoders for Text Modeling using Dilated Convolutions.

Abstract
Recent work on generative text modeling has found that variational autoencoders (VAE) with LSTM decoders perform worse than simpler LSTM language models (Bowman et al., 2015). This negative result is so far poorly understood, but has been attributed to the propensity of LSTM decoders to ignore conditioning information from the encoder. In this paper, we experiment with a new type of decoder for VAE: a dilated CNN. By changing the decoderu0027s dilation architecture, we control the size of context from previously generated words. In experiments, we find that there is a trade-off between contextual capacity of the decoder and effective use of encoding information. We show that when carefully managed, VAEs can outperform LSTM language models. We demonstrate perplexity gains on two datasets, representing the first positive language modeling result with VAE. Further, we conduct an in-depth investigation of the use of VAE (with our new decoding architecture) for semi-supervised and unsupervised labeling tasks, demonstrating gains over several strong baselines.

Year	Venue	DocType
2017	ICML	Conference
Volume	Citations	PageRank
abs/1702.08139	34	1.09
References	Authors
24	4

Authors (4 rows)

Cited by (34 rows)

References (24 rows)

Name	Order	Citations	PageRank
Zichao Yang	1	783	26.81
Zhiting Hu	2	758	32.20
Ruslan Salakhutdinov	3	12190	764.15
Taylor Berg-Kirkpatrick	4	554	35.93

1