Target Tracking for Contextual Bandits: Application to Demand Side Management. | 0 | 0.34 | 2019 |
Uniform regret bounds over Rd for the sequential linear regression problem with the square loss. | 0 | 0.34 | 2018 |
KL-UCB-switch: optimal regret bounds for stochastic bandits from both a distribution-dependent and a distribution-free viewpoints. | 0 | 0.34 | 2018 |
Sequential model aggregation for production forecasting. | 0 | 0.34 | 2018 |
Fano's Inequality for Random Variables | 0 | 0.34 | 2017 |
Explore First, Exploit Next: The True Shape of Regret in Bandit Problems | 16 | 0.94 | 2016 |
Set-valued approachability and online learning with partial monitoring | 4 | 0.53 | 2014 |
A Second-order Bound with Excess Losses. | 13 | 0.75 | 2014 |
Approachability in unknown games: Online learning meets multi-objective optimization. | 2 | 0.39 | 2014 |
A Primal Condition For Approachability With Partial Monitoring | 0 | 0.34 | 2013 |
Forecasting electricity consumption by aggregating specialized experts - A review of the sequential aggregation of specialized experts, with an application to Slovakian and French country-wide one-day-ahead (half-)hourly predictions. | 2 | 0.51 | 2013 |
A new look at shifting regret | 13 | 0.77 | 2012 |
Mirror Descent Meets Fixed Share (and feels no regret). | 17 | 0.83 | 2012 |
Forecasting electricity consumption by aggregating specialized experts | 11 | 0.95 | 2012 |
X-Armed Bandits | 14 | 1.07 | 2011 |
A Finite-Time Analysis of Multi-armed Bandits Problems with Kullback-Leibler Divergences. | 22 | 3.81 | 2011 |
A Finite-Time Analysis of Multi-armed Bandits Problems with Kullback-Leibler Divergences | 0 | 0.34 | 2011 |
Robust approachability and regret minimization in games with partial monitoring | 0 | 0.34 | 2011 |
Lipschitz bandits without the Lipschitz constant | 20 | 1.21 | 2011 |
Pure exploration in finitely-armed and continuous-armed bandits | 13 | 0.84 | 2011 |
Robust approachability and regret minimization in games with partial monitoring | 6 | 0.52 | 2011 |
A Geometric Proof of Calibration | 10 | 0.75 | 2010 |
Online Multi-task Learning with Hard Constraints. | 4 | 0.47 | 2009 |
Pure exploration in multi-armed bandits problems | 6 | 0.95 | 2008 |
Online Optimization in X-Armed Bandits | 43 | 3.51 | 2008 |
Strategies for Prediction Under Imperfect Monitoring | 10 | 0.98 | 2008 |
Learning correlated equilibria in games with compact sets of strategies | 37 | 2.36 | 2007 |
Improved second-order bounds for prediction with expert advice | 45 | 2.60 | 2007 |
Regret Minimization Under Partial Monitoring | 23 | 2.48 | 2006 |
Internal Regret in On-Line Portfolio Selection | 20 | 1.63 | 2005 |