Abstract | ||
---|---|---|
We propose the Ω-return as an alternative to the λ-return currently used by the TD(λ) family of algorithms. The benefit of the Ω-return is that it accounts for the correlation of different length returns. Because it is difficult to compute exactly, we suggest one way of approximating the Ω-return. We provide empirical studies that suggest that it is superior to the λ-return and γ-return for a variety of problems. |
Year | Venue | Field |
---|---|---|
2015 | Annual Conference on Neural Information Processing Systems | Mathematical optimization,Computer science,Correlation,Empirical research |
DocType | Citations | PageRank |
Conference | 1 | 0.39 |
References | Authors | |
10 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Philip S. Thomas | 1 | 184 | 22.55 |
S. Niekum | 2 | 165 | 23.73 |
Georgios Theocharous | 3 | 140 | 16.65 |
George Konidaris | 4 | 801 | 59.30 |