Abstract | ||
---|---|---|
We propose a semi-supervised approach towards anomaly detection in multivariate categorical data. Our goal is to learn a model that can distinguish the anomalous data, given a small set of training data from the normal class. To this end, our approach learns the probability distribution of normal instances with the assumption that the categorical data are generated from a continuous latent space. Gaussian process is adopted to construct the generative model. As a non-parametric Bayesian model, Gaussian process can adapt its model complexity according to the data size. Hence, our approach can be effective when the training dataset is small. Comprehensive experiments over different benchmarks clearly demonstrate the effectiveness of our approach. (c) 2021 Elsevier B.V. All rights reserved. |
Year | DOI | Venue |
---|---|---|
2021 | 10.1016/j.knosys.2021.106896 | KNOWLEDGE-BASED SYSTEMS |
Keywords | DocType | Volume |
Anomaly detection, Categorical data, Gaussian process, Data-efficient learning | Journal | 220 |
ISSN | Citations | PageRank |
0950-7051 | 0 | 0.34 |
References | Authors | |
0 | 6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Fengmao Lv | 1 | 1 | 0.68 |
Tao Liang | 2 | 26 | 6.42 |
Jiayi Zhao | 3 | 0 | 1.01 |
Zhongliu Zhuo | 4 | 0 | 1.35 |
Jinzhao Wu | 5 | 8 | 4.99 |
Guowu Yang | 6 | 20 | 6.39 |