DRL4IR: 3rd Workshop on Deep Reinforcement Learning for Information Retrieval | 0 | 0.34 | 2022 |
Towards Deployment-Efficient Reinforcement Learning: Lower Bound and Optimality | 0 | 0.34 | 2022 |
Inspector: Pixel-Based Automated Game Testing via Exploration, Detection, and Investigation | 0 | 0.34 | 2022 |
Independence-aware Advantage Estimation. | 0 | 0.34 | 2021 |
Return-Based Contrastive Representation Learning for Reinforcement Learning | 0 | 0.34 | 2021 |
DRL4IR: 2nd Workshop on Deep Reinforcement Learning for Information Retrieval | 0 | 0.34 | 2021 |
Demonstration actor critic | 0 | 0.34 | 2021 |
Leveraging Demonstrations for Reinforcement Recommendation Reasoning over Knowledge Graphs | 5 | 0.42 | 2020 |
RD$2$ - Reward Decomposition with Representation Decomposition. | 0 | 0.34 | 2020 |
Distributional Reward Decomposition for Reinforcement Learning | 0 | 0.34 | 2019 |
Fully Parameterized Quantile Function for Distributional Reinforcement Learning | 0 | 0.34 | 2019 |
Semi-Supervised Neural Machine Translation via Marginal Distribution Estimation. | 0 | 0.34 | 2019 |
Unified Policy Optimization for Robust Reinforcement Learning. | 0 | 0.34 | 2019 |
Trust Region Evolution Strategies | 1 | 0.36 | 2019 |
Investment Behaviors Can Tell What Inside: Exploring Stock Intrinsic Properties for Stock Trend Prediction | 4 | 0.54 | 2019 |
Individualized Indicator for All: Stock-wise Technical Indicator Optimization with Stock Embedding | 1 | 0.48 | 2019 |
Learning Structured Representation for Text Classification via Reinforcement Learning. | 7 | 0.42 | 2018 |
Word Attention for Sequence to Sequence Text Understanding. | 2 | 0.34 | 2018 |
Efficient Sequence Learning with Group Recurrent Networks. | 0 | 0.34 | 2018 |
Dual Transfer Learning for Neural Machine Translation with Marginal Distribution Regularization. | 7 | 0.50 | 2018 |
Reinforcement Learning for Relation Classification From Noisy Data. | 10 | 0.46 | 2018 |
Sequence Prediction with Unlabeled Data by Reward Function Learning. | 4 | 0.41 | 2017 |
Attention-based LSTM for Aspect-level Sentiment Classification. | 127 | 3.30 | 2016 |
Semi-Supervised Multinomial Naive Bayes for Text Classification by Leveraging Word-Level Statistical Constraint. | 3 | 0.38 | 2016 |
Sentiment Extraction by Leveraging Aspect-Opinion Association Structure | 1 | 0.35 | 2015 |
Clustering Aspect-related Phrases by Leveraging Sentiment Distribution Consistency. | 4 | 0.42 | 2014 |