Title
Recurrent Orthogonal Networks and Long-Memory Tasks.
Abstract
Although RNNs have been shown to be powerful tools for processing sequential data, finding architectures or optimization strategies that allow them to model very long term dependencies is still an active area of research. In this work, we carefully analyze two synthetic datasets originally outlined in (Hochreiter & Schmidhuber, 1997) which are used to evaluate the ability of RNNs to store information over many time steps. We explicitly construct RNN solutions to these problems, and using these constructions, illuminate both the problems themselves and the way in which RNNs store different types of information in their hidden states. These constructions furthermore explain the success of recent methods that specify unitary initializations or constraints on the transition matrices.
Year
Venue
Field
2016
ICML
Sequential data,Matrix (mathematics),Computer science,Unitary state,Artificial intelligence,Machine learning,Long memory
DocType
Citations 
PageRank 
Conference
21
0.94
References 
Authors
13
3
Name
Order
Citations
PageRank
Mikael Henaff127212.83
Arthur Szlam2105668.60
Yann LeCun3260903771.21