Abstract | ||
---|---|---|
Nowadays, many e-commerce companies are using reinforcement learning recommendation methods to maximize long-term benefits. Alibaba Group and Nanjing University build “Virtual Taobao”, a Taobao simulator. In this paper, we proposed TTD3 based on TD3 and trained it in Virtual Taobao. There are three important improvements in TTD3's training process. First, the current actor-network and target actor... |
Year | DOI | Venue |
---|---|---|
2021 | 10.1109/ISCC53001.2021.9631429 | 2021 IEEE Symposium on Computers and Communications (ISCC) |
Keywords | DocType | ISBN |
Training,Computers,Computational modeling,Reinforcement learning,Companies,Electronic commerce | Conference | 978-1-6654-2744-9 |
Citations | PageRank | References |
0 | 0.34 | 0 |
Authors | ||
4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Huang Lan | 1 | 10 | 13.31 |
Xiaofang Zhang | 2 | 0 | 0.34 |
Yan Wang | 3 | 0 | 0.34 |
Xuping Xie | 4 | 0 | 0.34 |