Abstract | ||
---|---|---|
In this paper, we introduce a generalized value iteration network (GVIN), which is an end-to-end neural network planning module. GVIN emulates the value iteration algorithm by using a novel graph convolution operator, which enables GVIN to learn and plan on irregular spatial graphs. We propose three novel differentiable kernels as graph convolution operators and show that the embedding-based kernel achieves the best performance. Furthermore, we present episodic Q-learning, an improvement upon traditional n-step Q-learning that stabilizes training for VIN and GVIN. Lastly, we evaluate GVIN on planning problems in 2D mazes, irregular graphs, and real-world street networks, showing that GVIN generalizes well for both arbitrary graphs and unseen graphs of larger scale and outperforms a naive generalization of VIN (discretizing a spatial graph into a 2D image). |
Year | Venue | DocType |
---|---|---|
2018 | THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | Conference |
Volume | Citations | PageRank |
abs/1706.02416 | 1 | 0.36 |
References | Authors | |
14 | 6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Sufeng Niu | 1 | 15 | 3.54 |
Siheng Chen | 2 | 324 | 27.85 |
Hanyu Guo | 3 | 1 | 0.36 |
Colin Targonski | 4 | 1 | 0.36 |
Melissa C. Smith | 5 | 148 | 16.54 |
Jelena Kovacevic | 6 | 802 | 95.87 |