Abstract | ||
---|---|---|
We consider the task of opportunistic channel access in a primary system
composed of independent Gilbert-Elliot channels where the secondary (or
opportunistic) user does not dispose of a priori information regarding the
statistical characteristics of the system. It is shown that this problem may be
cast into the framework of model-based learning in a specific class of
Partially Observed Markov Decision Processes (POMDPs) for which we introduce an
algorithm aimed at striking an optimal tradeoff between the exploration (or
estimation) and exploitation requirements. We provide finite horizon regret
bounds for this algorithm as well as a numerical evaluation of its performance
in the single channel model as well as in the case of stochastically identical
channels. |
Year | Venue | Keywords |
---|---|---|
2009 | Clinical Orthopaedics and Related Research | artificial intelligent |
Field | DocType | Volume |
Mathematical optimization,Channel models,Regret,Partially observable Markov decision process,Computer science,A priori and a posteriori,Markov decision process,Communication channel,Artificial intelligence,Finite horizon,Machine learning,Cognitive radio | Journal | abs/0908.0 |
Citations | PageRank | References |
0 | 0.34 | 12 |
Authors | ||
3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Sarah Filippi | 1 | 96 | 7.66 |
O. Cappe | 2 | 2112 | 207.95 |
Aurélien Garivier | 3 | 12 | 2.89 |