Name
Papers
Collaborators
SHAI SHALEV-SHWARTZ
123
62
Citations 
PageRank 
Referers 
3681
276.32
5730
Referees 
References 
991
1285
Search Limit
1001000
Title
Citations
PageRank
Year
Efficient Learning of CNNs using Patch Based Features.00.342022
Computational Separation Between Convolutional and Fully-Connected Networks00.342021
The Connection Between Approximation, Depth Separation and Learnability in Neural Networks.00.342021
The Implicit Bias of Depth: How Incremental Learning Drives Generalization00.342020
Proving the Lottery Ticket Hypothesis: Pruning is All You Need00.342020
The Implications of Local Correlation on Learning Some Deep Functions00.342020
Decoupling Gating from Linearity.00.342019
Vision Zero: on a Provable Method for Eliminating Roadway Accidents without Compromising Traffic Throughput.00.342019
Is Deeper Better only when Shallow is Good?10.362019
A Provably Correct Algorithm for Deep Learning that Actually Works.40.392018
Decoupling "when to update" from "how to update".160.612017
Weight Sharing is Crucial to Succesful Optimization.30.422017
SGD Learns Over-parameterized Networks that Provably Generalize on Linearly Separable Data.310.862017
Average Stability is Invariant to Data Preconditioning. Implications to Exp-concave Empirical Risk Minimization.20.412017
Failures of Gradient-Based Deep Learning.250.992017
On a Formal Model of Safe and Scalable Self-driving Cars.211.382017
Effective Semisupervised Learning on Manifolds.00.342017
Tightening the Sample Complexity of Empirical Risk Minimization via Preconditioned Stability.00.342016
Minimizing the Maximal Loss: How and Why?70.542016
SDCA without Duality, Regularization, and Individual Convexity.200.932016
Safe, Multi-Agent, Reinforcement Learning for Autonomous Driving.361.672016
On Lower and Upper Bounds in Smooth and Strongly Convex Optimization.00.342016
Long-term Planning by Short-term Prediction.70.552016
Faster Low-rank Approximation using Adaptive Gap-based Preconditioning.00.342016
Subspace Learning with Partial Information.00.342016
Solving Ridge Regression using Sketched Preconditioned SVRG.70.452016
Learning a Metric Embedding for Face Recognition using the Multibatch Method.90.502016
Distribution Free Learning with Local Queries.00.342016
On the Sample Complexity of End-to-end Training vs. Semantic Abstraction Training.110.592016
Faster SGD Using Sketched Conditioning20.392015
On Lower and Upper Bounds for Smooth and Strongly Convex Optimization Problems.00.342015
SDCA without Duality.221.082015
Accelerated proximal stochastic dual coordinate ascent for regularized loss minimization.1244.862014
SelfieBoost: A Boosting Algorithm for Deep Learning.30.422014
The Complexity of Learning Halfspaces using Generalized Linear Methods.30.392014
On the Computational Efficiency of Training Neural Networks.362.112014
Complexity theoretic limitations on learning DNF's.240.852014
The Sample Complexity of Subspace Learning with Partial Information.20.412014
Efficient active learning of halfspaces: an aggressive approach30.392013
More data speeds up training time in learning halfspaces over sparse vectors.160.612013
Stochastic dual coordinate ascent methods for regularized loss231.212013
A Provably Efficient Algorithm for Training Deep Networks40.702013
Learning Optimally Sparse Support Vector Machines.70.502013
Accelerated Mini-Batch Stochastic Dual Coordinate Ascent522.632013
The error rate of learning halfspaces using Kernel-SVMs00.342012
Using More Data to Speed-up Training Time120.812012
Near-Optimal Algorithms for Online Matrix Prediction00.342012
Proximal Stochastic Dual Coordinate Ascent412.522012
Learning Halfspaces with the Zero-One Loss: Time-Accuracy Tradeoffs.80.502012
Learning Sparse Low-Threshold Linear Classifiers00.342012
  • 1
  • 2