Title | ||
---|---|---|
Reliability and Learnability of Human Bandit Feedback for Sequence-to-Sequence Reinforcement Learning. |
Year | Venue | DocType |
---|---|---|
2018 | ACL | Conference |
Citations | PageRank | References |
0 | 0.34 | 0 |
Authors | ||
3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Stefan Riezler | 1 | 1066 | 138.72 |
julia kreutzer | 2 | 22 | 5.92 |
Joshua Uyheng | 3 | 0 | 1.69 |