DILOF: Effective and Memory Efficient Local Outlier Detection in Data Streams. - Citegraph

Paper Info

Title
DILOF: Effective and Memory Efficient Local Outlier Detection in Data Streams.

Abstract
With precipitously growing demand to detect outliers in data streams, many studies have been conducted aiming to develop extensions of well-known outlier detection algorithm called Local Outlier Factor (LOF), for data streams. However, existing LOF-based algorithms for data streams still suffer from two inherent limitations: 1) Large amount of memory space is required. 2) A long sequence of outliers is not detected. In this paper, we propose a new outlier detection algorithm for data streams, called DILOF that effectively overcomes the limitations. To this end, we first develop a novel density-based sampling algorithm to summarize past data and then propose a new strategy for detecting a sequence of outliers. It is worth noting that our proposing algorithms do not require any prior knowledge or assumptions on data distribution. Moreover, we accelerate the execution time of DILOF about 15 times by developing a powerful distance approximation technique. Our comprehensive experiments on real-world datasets demonstrate that DILOF significantly outperforms the state-of-the-art competitors in terms of accuracy and execution time. The source code for the proposed algorithm is available at our website: http://di.postech.ac.kr/DILOF.

Year	DOI	Venue
2018	10.1145/3219819.3220022	KDD
Keywords	Field	DocType
Outlier detection,Data streams,Density-based sampling	Anomaly detection,Local outlier factor,Data mining,Distance approximation,Data stream mining,Computer science,Source code,Outlier,Execution time,Sampling (statistics),Artificial intelligence,Machine learning	Conference
ISBN	Citations	PageRank
978-1-4503-5552-0	4	0.42
References	Authors
15	3

Authors (3 rows)

Cited by (4 rows)

References (15 rows)

Name	Order	Citations	PageRank
Gyoung S. Na	1	4	1.43
Dong Hyun Kim	2	4	0.42
Hwanjo Yu	3	1715	114.02

1