A Mathematical Framework for Feature Selection from Real-World Data with Non-Linear Observations. - Citegraph

Paper Info

Title
A Mathematical Framework for Feature Selection from Real-World Data with Non-Linear Observations.

Abstract
In this paper, we study the challenge of feature selection based on a relatively small collection of sample pairs ${(x_i, y_i)}_{1 leq i leq m}$. The observations $y_i mathbb{R}$ are thereby supposed to follow a noisy single-index model, depending on a certain set of signal variables. A major difficulty is that these variables usually cannot be observed directly, but rather arise as hidden factors in the actual data vectors $x_i mathbb{R}^d$ (feature variables). We will prove that a successful variable selection is still possible in this setup, even when the applied estimator does not have any knowledge of the underlying model parameters and only takes the u0027rawu0027 samples ${(x_i, y_i)}_{1 leq i leq m}$ as input. The model assumptions of our results will be fairly general, allowing for non-linear observations, arbitrary convex signal structures as well as strictly convex loss functions. This is particularly appealing for practical purposes, since in many applications, already standard methods, e.g., the Lasso or logistic regression, yield surprisingly good outcomes. Apart from a general discussion of the practical scope of our theoretical findings, we will also derive a rigorous guarantee for a specific real-world problem, namely sparse feature extraction from (proteomics-based) mass spectrometry data.

Year	Venue	Field
2016	arXiv: Machine Learning	Nonlinear system,Feature selection,Lasso (statistics),Regular polygon,Feature extraction,Convex function,Artificial intelligence,Mathematics,Machine learning,Estimator
DocType	Volume	Citations
Journal	abs/1608.08852	0
PageRank	References	Authors
0.34	0	2

Authors (2 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Martin Genzel	1	10	2.61
Gitta Kutyniok	2	325	34.77

1