Title
DILOF: Effective and Memory Efficient Local Outlier Detection in Data Streams.
Abstract
With precipitously growing demand to detect outliers in data streams, many studies have been conducted aiming to develop extensions of well-known outlier detection algorithm called Local Outlier Factor (LOF), for data streams. However, existing LOF-based algorithms for data streams still suffer from two inherent limitations: 1) Large amount of memory space is required. 2) A long sequence of outliers is not detected. In this paper, we propose a new outlier detection algorithm for data streams, called DILOF that effectively overcomes the limitations. To this end, we first develop a novel density-based sampling algorithm to summarize past data and then propose a new strategy for detecting a sequence of outliers. It is worth noting that our proposing algorithms do not require any prior knowledge or assumptions on data distribution. Moreover, we accelerate the execution time of DILOF about 15 times by developing a powerful distance approximation technique. Our comprehensive experiments on real-world datasets demonstrate that DILOF significantly outperforms the state-of-the-art competitors in terms of accuracy and execution time. The source code for the proposed algorithm is available at our website: http://di.postech.ac.kr/DILOF.
Year
DOI
Venue
2018
10.1145/3219819.3220022
KDD
Keywords
Field
DocType
Outlier detection,Data streams,Density-based sampling
Anomaly detection,Local outlier factor,Data mining,Distance approximation,Data stream mining,Computer science,Source code,Outlier,Execution time,Sampling (statistics),Artificial intelligence,Machine learning
Conference
ISBN
Citations 
PageRank 
978-1-4503-5552-0
4
0.42
References 
Authors
15
3
Name
Order
Citations
PageRank
Gyoung S. Na141.43
Dong Hyun Kim240.42
Hwanjo Yu31715114.02