Title
Outlier Detection Based On Low Density Models
Abstract
Most outlier detection algorithms are based on lazy learning or imply quadratic complexity. Both characteristics make them unsuitable for big data and stream data applications and preclude their applicability in systems that must operate autonomously. In this paper we propose a new algorithm-called SDO (Sparse Data Observers)-to estimate outlierness based on low density models of data. SDO is an eager learner; therefore, computational costs in application phases are severely reduced. We perform tests with a wide variation of synthetic datasets as well as the main datasets published in the literature for anomaly detection testing. Results show that SDO satisfactorily competes with the best ranked outlier detection alternatives. The good detection performance coupled with a low complexity makes SDO highly flexible and adaptable to stand-alone frameworks that must detect outliers fast with accuracy rates equivalent to lazy learning algorithms.
Year
DOI
Venue
2018
10.1109/ICDMW.2018.00140
2018 18TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW)
Field
DocType
ISSN
Data modeling,Anomaly detection,Data mining,Ranking,Computer science,Lazy learning,Outlier,Artificial intelligence,Cluster analysis,Big data,Sparse matrix,Machine learning
Conference
2375-9232
Citations 
PageRank 
References 
1
0.35
0
Authors
3
Name
Order
Citations
PageRank
Félix Iglesias Vázquez1637.10
Tanja Zseby219936.35
Arthur Zimek3222893.16