SEARNN: Training RNNs with Global-Local Losses. - Citegraph

Paper Info

Title
SEARNN: Training RNNs with Global-Local Losses.

Abstract
We propose SEARNN, a novel training algorithm for recurrent neural networks (RNNs) inspired by the learning to (L2S) approach to structured prediction. RNNs have been widely successful in structured prediction applications such as machine translation or parsing, and are commonly trained using maximum likelihood estimation (MLE). Unfortunately, this training loss is not always an appropriate surrogate for the test error: by only maximizing the ground truth probability, it fails to exploit the wealth of information offered by structured losses. Further, it introduces discrepancies between training and predicting (such as exposure bias) that may hurt test performance. Instead, SEARNN leverages test-alike search space exploration to introduce global-local losses that are closer to the test error. We first demonstrate improved performance over MLE on two different tasks: OCR and spelling correction. Then, we propose a subsampling strategy to enable SEARNN to scale to large vocabulary sizes. This allows us to validate the benefits of our approach on a machine translation task.

Year	Venue	Field
2017	international conference on learning representations	Structured prediction,Machine translation,Recurrent neural network,Exploit,Ground truth,Spelling,Artificial intelligence,Parsing,Vocabulary,Machine learning,Mathematics
DocType	Volume	Citations
Journal	abs/1706.04499	6
PageRank	References	Authors
0.40	24	4

Authors (4 rows)

Cited by (6 rows)

References (24 rows)

Name	Order	Citations	PageRank
Rémi Leblond	1	6	0.74
Jean-Baptiste Alayrac	2	85	7.47
A. Osokin	3	430	19.01
Simon Lacoste-Julien	4	1138	62.72

1