Policy Evaluation Using the Ω-Return - Citegraph

Paper Info

Title
Policy Evaluation Using the Ω-Return

Abstract
We propose the Ω-return as an alternative to the λ-return currently used by the TD(λ) family of algorithms. The benefit of the Ω-return is that it accounts for the correlation of different length returns. Because it is difficult to compute exactly, we suggest one way of approximating the Ω-return. We provide empirical studies that suggest that it is superior to the λ-return and γ-return for a variety of problems.

Year	Venue	Field
2015	Annual Conference on Neural Information Processing Systems	Mathematical optimization,Computer science,Correlation,Empirical research
DocType	Citations	PageRank
Conference	1	0.39
References	Authors
10	4

Authors (4 rows)

Cited by (1 rows)

References (10 rows)

Name	Order	Citations	PageRank
Philip S. Thomas	1	184	22.55
S. Niekum	2	165	23.73
Georgios Theocharous	3	140	16.65
George Konidaris	4	801	59.30

1