Abstract | ||
---|---|---|
Big Data systems are producing huge amounts of data in real-time. Finding anomalies in these systems is becoming increasingly important, since it can help to reduce the number of failures, and improve the mean time of recovery. In this work, we present EXAD, a new prototype system for explainable anomaly detection, in particular for detecting and explaining anomalies in time-series data obtained from traces of Apache Spark jobs. Apache Spark has become the most popular software tool for processing Big Data. The new system contains the most wellknown approaches to anomaly detection, and a novel generator of artificial traces, that can help the user to understand the different performances of the different methodologies. In this demo, we will show how this new framework works, and how users can benefit of detecting anomalies in an efficient and fast way when dealing with traces of jobs of Big Data systems. |
Year | DOI | Venue |
---|---|---|
2018 | 10.1109/ICDMW.2018.00204 | 2018 18TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW) |
Keywords | Field | DocType |
anomaly detection, machine learning, Spark | Software tool,Data mining,Anomaly detection,Spark (mathematics),Computer science,Feature extraction,Artificial intelligence,Big data,Computer cluster,Machine learning | Conference |
ISSN | Citations | PageRank |
2375-9232 | 1 | 0.35 |
References | Authors | |
0 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Fei Song | 1 | 41 | 8.63 |
Yanlei Diao | 2 | 2234 | 108.95 |
Jesse Read | 3 | 34 | 4.67 |
Arnaud Stiegler | 4 | 1 | 1.70 |
Albert Bifet | 5 | 2659 | 140.83 |