Title
Mirror Descent Meets Fixed Share (and feels no regret).
Abstract
Mirror descent with an entropic regularizer is known to achieve shifting regret bounds that are logarithmic in the dimension. This is done using either a carefully designed projection or by a weight sharing technique. Via a novel unified analysis, we show that these two approaches deliver essentially equivalent bounds on a notion of regret generalizing shifting, adaptive, discounted, and other related regrets. Our analysis also captures and extends the generalized weight sharing technique of Bousquet and Warmuth, and can be refined in several ways, including improvements for small losses and adaptive tuning of parameters.
Year
Venue
Field
2012
neural information processing systems
Mathematical economics,Mathematical optimization,Regret,Generalization,Logarithm,Mathematics
DocType
Volume
ISSN
Conference
25
NIPS 2012, Lake Tahoe : United States (2012)
Citations 
PageRank 
References 
17
0.83
7
Authors
4
Name
Order
Citations
PageRank
Nicolò Cesa-Bianchi14609590.83
Pierre Gaillard27910.89
GáBor Lugosi31092195.02
Gilles Stoltz435131.53