Dynamic Learning of Sequential Choice Bandit Problem under Marketing Fatigue. - Citegraph

Paper Info

Title
Dynamic Learning of Sequential Choice Bandit Problem under Marketing Fatigue.

Abstract
Motivated by the observation that overexposure to unwanted marketing activities leads to customer dissatisfaction, we consider a setting where a platform offers a sequence of messages to its users and is penalized when users abandon the platform due to marketing fatigue. We propose a novel sequential choice model to capture multiple interactions taking place between the platform and its user: Upon receiving a message, a user decides on one of the three actions: accept the message, skip and receive the next message, or abandon the platform. Based on user feedback, the platform dynamically learns users' abandonment distribution and their valuations of messages to determine the length of the sequence and the order of the messages, while maximizing the cumulative payoff over a horizon of length T. We refer to this online learning task as the sequential choice bandit problem. For the offline combinatorial optimization problem, we show a polynomial-time algorithm. For the online problem, we propose an algorithm that balances exploration and exploitation, and characterize its regret bound. Lastly, we demonstrate how to extend the model with user contexts to incorporate personalization.

Year	Venue	Field
2019	THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE	Online learning,Dynamic learning,Combinatorial optimization problem,Exposure,Regret,Computer science,Valuation (finance),Marketing,Stochastic game,Personalization
DocType	Volume	Citations
Journal	abs/1903.08193	1
PageRank	References	Authors
0.37	0	2

Authors (2 rows)

Cited by (1 rows)

References (0 rows)

Name	Order	Citations	PageRank
Junyu Cao	1	2	1.74
SUN Wei	2	247	26.63

1