Depthwise Separable Convolutions for Neural Machine Translation. - Citegraph

Paper Info

Title
Depthwise Separable Convolutions for Neural Machine Translation.

Abstract
Depthwise separable convolutions reduce the number of parameters and computation used in convolutional operations while increasing representational efficiency. They have been shown to be successful in image classification models, both in obtaining better models than previously possible for a given parameter count (the Xception architecture) and considerably reducing the number of parameters required to perform at a given level (the MobileNets family of architectures). Recently, convolutional sequence-to-sequence networks have been applied to machine translation tasks with good results. In this work, we study how depthwise separable convolutions can be applied to neural machine translation. We introduce a new architecture inspired by Xception and ByteNet, called SliceNet, which enables a significant reduction of the parameter count and amount of computation needed to obtain results like ByteNet, and, with a similar parameter count, achieves better results. In addition to showing that depthwise separable convolutions perform well for machine translation, we investigate the architectural changes that they enable: we observe that thanks to depthwise separability, we can increase the length of convolution windows, removing the need for filter dilation. We also introduce a new super-separable convolution operation that further reduces the number of parameters and computational cost of the models.

Year	Venue	Field
2017	ICLR	Dilation (morphology),Computer science,Convolution,Machine translation,Algorithm,Separable space,Theoretical computer science,Artificial intelligence,Contextual image classification,Machine learning,Computation
DocType	Volume	Citations
Journal	abs/1706.03059	16
PageRank	References	Authors
0.74	13	3

Authors (3 rows)

Cited by (16 rows)

References (13 rows)

Name	Order	Citations	PageRank
Łukasz Kaiser	1	2307	89.08
Aidan N. Gomez	2	534	13.23
François Chollet	3	51	3.85

1