Title
A comparison of the lasso and marginal regression
Abstract
The lasso is an important method for sparse, high-dimensional regression problems, with efficient algorithms available, a long history of practical success, and a large body of theoretical results supporting and explaining its performance. But even with the best available algorithms, finding the lasso solutions remains a computationally challenging task in cases where the number of covariates vastly exceeds the number of data points. Marginal regression, where each dependent variable is regressed separately on each covariate, offers a promising alternative in this case because the estimates can be computed roughly two orders faster than the lasso solutions. The question that remains is how the statistical performance of the method compares to that of the lasso in these cases. In this paper, we study the relative statistical performance of the lasso and marginal regression for sparse, high-dimensional regression problems. We consider the problem of learning which coefficients are non-zero. Our main results are as follows: (i) we compare the conditions under which the lasso and marginal regression guarantee exact recovery in the fixed design, noise free case; (ii) we establish conditions under which marginal regression provides exact recovery with high probability in the fixed design, noise free, random coefficients case; and (iii) we derive rates of convergence for both procedures, where performance is measured by the number of coefficients with incorrect sign, and characterize the regions in the parameter space recovery is and is not possible under this metric. In light of the computational advantages of marginal regression in very high dimensional problems, our theoretical and simulations results suggest that the procedure merits further study.
Year
DOI
Venue
2012
10.5555/2188385.2343712
Journal of Machine Learning Research
Keywords
Field
DocType
marginal regression guarantee,high-dimensional regression problem,lasso solution,noise free case,exact recovery,marginal regression,fixed design,parameter space recovery,relative statistical performance,statistical performance,phase diagram,regularization,lasso
Data point,Convergence (routing),Covariate,Regression,Elastic net regularization,Lasso (statistics),Regularization (mathematics),Variables,Artificial intelligence,Statistics,Machine learning,Mathematics
Journal
Volume
Issue
ISSN
13
1
1532-4435
Citations 
PageRank 
References 
23
1.12
7
Authors
4
Name
Order
Citations
PageRank
Christopher R. Genovese1802156.76
Jiashun Jin21147.75
Larry A. Wasserman375276.94
Zhigang Yao4455.12