Title
MONK - Outlier-Robust Mean Embedding Estimation by Median-of-Means.
Abstract
Mean embeddings provide an extremely flexible and powerful tool in machine learning and statistics to represent probability distributions and define a semi-metric (MMD, maximum mean discrepancy ; also called N-distance or energy distance), with numerous successful applications. The representation is constructed as the expectation of the feature map defined by a kernel. As a mean, its classical empirical estimator, however, can be arbitrary severely affected even by a single outlier in case of unbounded features. To the best of our knowledge, unfortunately even the consistency of the existing few techniques trying to alleviate this serious sensitivity bottleneck is unknown. In this paper, we show how the recently emerged principle of median-of-means can be used to design minimax-optimal estimators for kernel mean embedding and MMD, with finite-sample strong outlier-robustness guarantees.
Year
Venue
Field
2018
arXiv: Machine Learning
Maximum mean discrepancy,Kernel (linear algebra),Bottleneck,Mathematical optimization,Embedding,Algorithm,Outlier,Probability distribution,Energy distance,Mathematics,Estimator
DocType
Volume
Citations 
Journal
abs/1802.04784
0
PageRank 
References 
Authors
0.34
17
5
Name
Order
Citations
PageRank
Matthieu Lerasle142.49
Zoltán Szabó2809.15
Guillaume Lecue3172.50
Gaspar Massiot400.34
Eric Moulines512312.48