Title
New malicious code detection using variable length n-grams
Abstract
Most of the commercial antivirus software fail to detect unknown and new malicious code. In order to handle this problem generic virus detection is a viable option. Generic virus detector needs features that are common to viruses. Recently Kolter et al. [16] propose an efficient generic virus detector using n-grams as features. The fixed length n-grams used there suffer from the drawback that they cannot capture meaningful sequences of different lengths. In this paper we propose a new method of variable-length n-grams extraction based on the concept of episodes and demonstrate that they outperform fixed length n-grams in malicious code detection. The proposed algorithm requires only two scans over the whole data set whereas most of the classical algorithms require scans proportional to the maximum length of n-grams.
Year
DOI
Venue
2006
10.1007/11961635_19
ICISS
Keywords
DocType
ISBN
maximum length,fixed length n-grams,new malicious code detection,variable-length n-grams extraction,different length,variable length n-grams,problem generic virus detection,length n-grams,new malicious code,malicious code detection,generic virus detector,efficient generic virus detector,data mining
Conference
3-540-68962-1
Citations 
PageRank 
References 
11
0.74
19
Authors
3
Name
Order
Citations
PageRank
D. Krishna Sandeep Reddy1622.42
Subrat Kumar Dash2433.39
Arun K. Pujari342048.20