Token-Level And Sequence-Level Loss Smoothing For Rnn Language Models - Citegraph

Paper Info

Title
Token-Level And Sequence-Level Loss Smoothing For Rnn Language Models

Abstract
Despite the effectiveness of recurrent neural network language models, their maximum likelihood estimation suffers from two limitations. It treats all sentences that do not match the ground truth as equally poor, ignoring the structure of the output space. Second, it suffers from "exposure bias": during training tokens are predicted given ground-truth sequences, while at test time prediction is conditioned on generated output sequences. To overcome these limitations we build upon the recent reward augmented maximum likelihood approach i.e. sequence-level smoothing that encourages the model to predict sentences close to the ground truth according to a given performance metric. We extend this approach to token-level loss smoothing, and propose improvements to the sequence-level smoothing approach. Our experiments on two different tasks, image captioning and machine translation, show that token-level and sequence-level loss smoothing are complementary, and significantly improve results.

Year	DOI	Venue
2018	10.18653/v1/p18-1195	PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1
Field	DocType	Volume
Closed captioning,Computer science,Performance metric,Machine translation,Maximum likelihood,Smoothing,Ground truth,Artificial intelligence,Security token,Language model,Machine learning	Journal	abs/1805.05062
Citations	PageRank	References
2	0.36	0
Authors
3

Authors (3 rows)

Cited by (2 rows)

References (0 rows)

Name	Order	Citations	PageRank
Maha Elbayad	1	2	2.73
laurent besacier	2	696	102.67
J. J. Verbeek	3	3944	181.44

1