Title
IP2Vec: Learning Similarities Between IP Addresses.
Abstract
IP Addresses are a central part of packet- and flow-based network data. However, visualization and similarity computation of IP Addresses are challenging to due the missing natural order. This paper presents a novel similarity measure IP2Vec for IP Addresses that builds on ideas from Word2Vec, a popular approach in text mining. The key idea is to learn similarities by extracting available context information from network data. IP Addresses are similar if they appear in similar contexts. Thus, IP2Vec is automatically derived from the given network data set. The proposed approach is evaluated experimentally on two public flow-based data sets. In particular, we demonstrate the effectiveness of clustering IP Addresses within a botnet data set. In addition, we use visualization methods to analyse the learned similarities in more detail. These experiments indicate that IP2Vec is well suited to capture the similarity of IP Addresses based on their network communications.
Year
Venue
Field
2017
ICDM Workshops
Data mining,Data set,Similarity measure,Botnet,Computer science,Visualization,Network packet,Feature extraction,Word2vec,Cluster analysis
DocType
Citations 
PageRank 
Conference
3
0.39
References 
Authors
24
4
Name
Order
Citations
PageRank
Markus Ring1343.16
Alexander Dallmann283.17
Dieter Landes315928.78
Andreas Hotho43232210.84