Title
Altitude Training: Strong Bounds for Single-Layer Dropout.
Abstract
Dropout training, originally designed for deep neural networks, has been successful on high-dimensional single-layer natural language tasks. This paper proposes a theoretical explanation for this phenomenon: we show that, under a generative Poisson topic model with long documents, dropout training improves the exponent in the generalization bound for empirical risk minimization. Dropout achieves this gain much like a marathon runner who practices at altitude: once a classifier learns to perform reasonably well on training examples that have been artificially corrupted by dropout, it will do very well on the uncorrupted test set. We also show that, under similar conditions, dropout preserves the Bayes decision boundary and should therefore induce minimal bias in high dimensions.
Year
Venue
DocType
2014
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014)
Journal
Volume
ISSN
Citations 
27
1049-5258
11
PageRank 
References 
Authors
0.62
19
4
Name
Order
Citations
PageRank
Stefan Wager115616.00
Fithian, William2131.68
Sida Wang354144.65
Percy Liang43416172.27