Title
Generalisation error in learning with random features and the hidden manifold model*
Abstract
We study generalised linear regression and classification for a synthetically generated dataset encompassing different problems of interest, such as learning with random features, neural networks in the lazy training regime, and the hidden manifold model. We consider the high-dimensional regime and using the replica method from statistical physics, we provide a closed-form expression for the asymptotic generalisation performance in these problems, valid in both the under- and over-parametrised regimes and for a broad choice of generalised linear model loss functions. In particular, we show how to obtain analytically the so-called double descent behaviour for logistic regression with a peak at the interpolation threshold, we illustrate the superiority of orthogonal against random Gaussian projections in learning with random features, and discuss the role played by correlations in the data generated by the hidden manifold model. Beyond the interest in these particular problems, the theoretical formalism introduced in this manuscript provides a path to further extensions to more complex tasks.
Year
DOI
Venue
2020
10.1088/1742-5468/ac3ae6
JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT
Keywords
DocType
Volume
cavity and replica method, deep learning, learning theory, machine learning
Conference
2021
Issue
ISSN
Citations 
12
1742-5468
0
PageRank 
References 
Authors
0.34
0
5
Name
Order
Citations
PageRank
Federica Gerace101.69
Bruno Loureiro201.69
Florent Krzakala397767.30
Marc Mézard459039.09
Lenka Zdeborová5119078.62