Graph Neural Networks with Learnable Structural and Positional Representations | 0 | 0.34 | 2022 |
Building Robust Ensembles via Margin Boosting. | 0 | 0.34 | 2022 |
Recurrent Independent Mechanisms | 0 | 0.34 | 2021 |
Learning Neural Generative Dynamics for Molecular Conformation Generation | 0 | 0.34 | 2021 |
Deep Verifier Networks: Verification Of Deep Discriminative Models With Deep Generative Models | 0 | 0.34 | 2021 |
Multi-task self-supervised learning for Robust Speech Recognition | 1 | 0.37 | 2020 |
A learning-based algorithm to quickly compute good primal solutions for Stochastic Integer Programs | 0 | 0.34 | 2020 |
A Meta-Transfer Objective for Learning to Disentangle Causal Mechanisms | 1 | 0.35 | 2020 |
Speech Model Pre-training for End-to-End Spoken Language Understanding. | 2 | 0.36 | 2019 |
Updates of Equilibrium Prop Match Gradients of Backprop Through Time in an RNN with Static Input. | 0 | 0.34 | 2019 |
GradMask: Reduce Overfitting by Regularizing Saliency. | 0 | 0.34 | 2019 |
Learning Speaker Representations with Mutual Information | 3 | 0.40 | 2019 |
Learning Problem-agnostic Speech Representations from Multiple Self-supervised Tasks. | 1 | 0.34 | 2019 |
Unsupervised State Representation Learning in Atari. | 0 | 0.34 | 2019 |
Maximum Entropy Generators for Energy-Based Models. | 1 | 0.35 | 2019 |
Learning Anonymized Representations with Adversarial Neural Networks. | 3 | 0.37 | 2018 |
A3T: Adversarially Augmented Adversarial Training. | 3 | 0.39 | 2018 |
Neural Models for Key Phrase Extraction and Question Generation. | 5 | 0.41 | 2018 |
Disentangling the independently controllable factors of variation by interacting with the world. | 2 | 0.36 | 2018 |
Fraternal Dropout. | 0 | 0.34 | 2018 |
DEFactor: Differentiable Edge Factorization-based Probabilistic Graph Generation. | 2 | 0.37 | 2018 |
Dynamic Layer Normalization For Adaptive Neural Acoustic Modeling In Speech Recognition | 4 | 0.48 | 2017 |
Residual Connections Encourage Iterative Inference. | 10 | 0.61 | 2017 |
ACtuAL: Actor-Critic Under Adversarial Learning. | 1 | 0.34 | 2017 |
Measuring the tendency of CNNs to Learn Surface Statistical Regularities. | 19 | 0.66 | 2017 |
Variational Walkback: Learning a Transition Operator as a Stochastic Recurrent Net | 2 | 0.37 | 2017 |
Deep Learning for Patient-Specific Kidney Graft Survival Analysis. | 10 | 0.69 | 2017 |
Towards more hardware-friendly deep learning. | 0 | 0.34 | 2017 |
STDP-Compatible Approximation of Backpropagation in an Energy-Based Model. | 9 | 0.48 | 2017 |
Hierarchical Multiscale Recurrent Neural Networks. | 67 | 2.27 | 2017 |
Building End-To-End Dialogue Systems Using Generative Hierarchical Neural Network Models | 245 | 6.56 | 2016 |
Iterative Alternating Neural Attention for Machine Reading. | 23 | 1.05 | 2016 |
BinaryNet: Training Deep Neural Networks with Weights and Activations Constrained to +1 or -1. | 96 | 4.62 | 2016 |
Oracle Performance for Visual Captioning. | 1 | 0.38 | 2016 |
Generating Factoid Questions With Recurrent Neural Networks: The 30m Factoid Question-Answer Corpus | 34 | 1.06 | 2016 |
ReSeg: A Recurrent Neural Network for Object Segmentation | 8 | 0.57 | 2015 |
Neural Networks with Few Multiplications | 0 | 0.34 | 2015 |
On Using Very Large Target Vocabulary For Neural Machine Translation | 223 | 10.44 | 2015 |
Variance Reduction in SGD by Distributed Importance Sampling | 13 | 0.66 | 2015 |
IAPR keynote lecture IV: Deep learning | 0 | 0.34 | 2015 |
Early Inference in Energy-Based Models Approximates Back-Propagation | 0 | 0.34 | 2015 |
NICE: Non-linear Independent Components Estimation. | 0 | 0.34 | 2015 |
Blocks and Fuel: Frameworks for deep learning | 48 | 2.74 | 2015 |
A Hierarchical Recurrent Encoder-Decoder for Generative Context-Aware Query Suggestion | 83 | 2.27 | 2015 |
Equilibrated adaptive learning rates for non-convex optimization | 52 | 2.49 | 2015 |
Towards Biologically Plausible Deep Learning. | 33 | 1.68 | 2015 |
Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation. | 1305 | 52.71 | 2014 |
Low precision arithmetic for deep learning. | 35 | 2.18 | 2014 |
Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling. | 815 | 25.09 | 2014 |
The Spike-and-Slab RBM and Extensions to Discrete and Sparse Data Distributions. | 10 | 0.50 | 2014 |