Title
End-to-End Deep Reinforcement Learning based Recommendation with Supervised Embedding.
Abstract
The research of reinforcement learning (RL) based recommendation method has become a hot topic in recommendation community, due to the recent advance in interactive recommender systems. The existing RL recommendation approaches can be summarized into a unified framework with three components, namely embedding component (EC), state representation component (SRC) and policy component (PC). We find that EC cannot be nicely trained with the other two components simultaneously. Previous studies bypass the obstacle through a pre-training and fixing strategy, which makes their approaches unlike a real end-to-end fashion. More importantly, such pre-trained and fixed EC suffers from two inherent drawbacks: (1) Pre-trained and fixed embeddings are unable to model evolving preference of users and item correlations in the dynamic environment; (2) Pre-training is inconvenient in the industrial applications. To address the problem, in this paper, we propose an End-to-end Deep Reinforcement learning based Recommendation framework (EDRR). In this framework, a supervised learning signal is carefully designed for smoothing the update gradients to EC, and three incorporating ways are introduced and compared. To the best of our knowledge, we are the first to address the training compatibility between the three components in RL based recommendations. Extensive experiments are conducted on three real-world datasets, and the results demonstrate the proposed EDRR effectively achieves the end-to-end training purpose for both policy-based and value-based RL models, and delivers better performance than state-of-the-art methods.
Year
DOI
Venue
2020
10.1145/3336191.3371858
WSDM '20: The Thirteenth ACM International Conference on Web Search and Data Mining Houston TX USA February, 2020
Keywords
Field
DocType
Recommendation, Reinforcement Learning, End-to-End, Supervised Embedding
Embedding,Information retrieval,End-to-end principle,Computer science,Reinforcement learning
Conference
ISBN
Citations 
PageRank 
978-1-4503-6822-3
2
0.37
References 
Authors
27
6
Name
Order
Citations
PageRank
feng liu118039.13
Guo Huifeng213415.44
Li Xutao336636.06
Ruiming Tang4397.21
Yunming Ye513715.58
Xiuqiang He631239.21