Title
Learning Explainable Representations of Malware Behavior
Abstract
We address the problems of identifying malware in network telemetry logs and providing indicators of compromise-comprehensible explanations of behavioral patterns that identify the threat. In our system, an array of specialized detectors abstracts network-flow data into comprehensible network events in a first step. We develop a neural network that processes this sequence of events and identifies specific threats, malware families and broad categories of malware. We then use the integrated-gradients method to highlight events that jointly constitute the characteristic behavioral pattern of the threat. We compare network architectures based on CNNs, LSTMs, and transformers, and explore the efficacy of unsupervised pre-training experimentally on large-scale telemetry data. We demonstrate how this system detects njRAT and other malware based on behavioral patterns.
Year
DOI
Venue
2021
10.1007/978-3-030-86514-6_4
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: APPLIED DATA SCIENCE TRACK, PT IV
Keywords
DocType
Volume
Neural networks, Malware detection, Sequence models, Unsupervised pre-training
Conference
12978
ISSN
Citations 
PageRank 
0302-9743
0
0.34
References 
Authors
0
6
Name
Order
Citations
PageRank
Paul Prasse1133.45
Jan Brabec200.34
jan kohout3325.31
Martin Kopp400.34
Lukas Bajer500.34
Tobias Scheffer61862139.64