Optimal Errors And Phase Transitions In High-Dimensional Generalized Linear Models - Citegraph

Paper Info

Title
Optimal Errors And Phase Transitions In High-Dimensional Generalized Linear Models

Abstract
Generalized linear models (GLMs) are used in high-dimensional machine learning, statistics, communications, and signal processing. In this paper we analyze GLMs when the data matrix is random, as relevant in problems such as compressed sensing, error-correcting codes, or benchmark models in neural networks. We evaluate the mutual information (or "free entropy") from which we deduce the Bayes-optimal estimation and generalization errors. Our analysis applies to the high-dimensional limit where both the number of samples and the dimension are large and their ratio is fixed. Nonrigorous predictions for the optimal errors existed for special cases of GLMs, e.g., for the perceptron, in the field of statistical physics based on the so-called replica method. Our present paper rigorously establishes those decades-old conjectures and brings forward their algorithmic interpretation in terms of performance of the generalized approximate message-passing algorithm. Furthermore, we tightly characterize, for many learning problems, regions of parameters for which this algorithm achieves the optimal performance and locate the associated sharp phase transitions separating learnable and nonlearnable regions. We believe that this random version of GLMs can serve as a challenging benchmark for multipurpose algorithms.

Year	DOI	Venue
2018	10.1073/pnas.1802705116	PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA
Keywords	DocType	Volume
high-dimensional inference, generalized linear model, Bayesian inference, perceptron, approximate message-passing algorithm	Conference	116
Issue	ISSN	Citations
12	0027-8424	6
PageRank	References	Authors
0.48	0	5

Authors (5 rows)

Cited by (6 rows)

References (0 rows)

Name	Order	Citations	PageRank
Jean Barbier	1	117	13.46
Florent Krzakala	2	977	67.30
Nicolas Macris	3	268	26.29
Léo Miolane	4	6	0.48
Lenka Zdeborová	5	1190	78.62

1