Abstract | ||
---|---|---|
The problem of interest is how to dynamically allocate wireless access services in a competitive market which implements a take-it-or-leave-it allocation mechanism. In this paper we focus on the subproblem of preference elicitation, given a mechanism. The user, due to a number of cognitive and technical reasons, is assumed to be initially uninformed over their preferences in the wireless domain. The solution we have developed is a closed-loop user-agent system that assists the user in application, task and context dependent service provisioning by adaptively and interactively learning to select the best wireless data service. The agent learns an incrementally revealed user preference model given explicit or implicit feedback on its decisions by the user. We model this closed-loop system as a Markov Decision Process, where the agent actions are rewarded by the user, and show how a reinforcement learning algorithm can be used to learn a model of the userýs preferences on-line in the given allocation mechanism. We evaluate the performance and value of the agent in a series of preliminary empirical user studies. |
Year | DOI | Venue |
---|---|---|
2004 | 10.1109/AAMAS.2004.161 | AAMAS |
Keywords | DocType | ISBN |
closed-loop user-agent system,learning user preferences,wireless access service,user preference model,preliminary empirical user study,take-it-or-leave-it allocation mechanism,allocation mechanism,wireless domain,agent action,wireless services provisioning,closed-loop system,wireless data service,feedback,game theory,artificial intelligence,security,context dependent,internet,secret sharing,computer science,markov decision process,hardware,cost accounting,user agent,reinforcement learning | Conference | 1-58113-864-4 |
Citations | PageRank | References |
10 | 0.97 | 6 |
Authors | ||
4 |
Name | Order | Citations | PageRank |
---|---|---|---|
G. Lee | 1 | 10 | 0.97 |
S. Bauer | 2 | 10 | 0.97 |
P. Faratin | 3 | 54 | 7.25 |
John Wroclawski | 4 | 1965 | 247.15 |