Title
Exploration-Exploitation Tradeoffs for Experts Algorithms in Reactive Environments
Abstract
A reactive environment is one that responds to the actions of an agent rather than evolving obliviously. In reactive environments, experts algorithms must balance exploration and exploitation of experts more carefully than in oblivious ones. In addition, a more subtle denition of a learnable value of an expert is required. A general exploration-exploitation experts method is presented along with a proper denition of value. The method is shown to asymptotically perform as well as the best available expert. Several variants are analyzed from the viewpoint of the exploration-exploitation tradeoff, including explore-then-exploit, polynomially vanishing exploration, constant-frequency exploration, and constant-size explo- ration phases. Complexity and performance bounds are proven.
Year
Venue
Field
2004
Neural Information Processing Systems
Computer science,Algorithm,Artificial intelligence,Machine learning
DocType
Citations 
PageRank 
Conference
11
0.97
References 
Authors
3
2
Name
Order
Citations
PageRank
Daniela Pucci de Farias117617.09
Nimrod Megiddo24244668.46