Abstract | ||
---|---|---|
Due to the increasing number of malware samples in the past 4 years, machine learning algorithms emerged as an important tool in automated malware detection. This approach to create the detection model requires, however, a lot of time with a continually growing data-set. Often changes in malware families and the increasing training time makes the model less efficient and increases the probability of false alarms. This paper approaches this matter by reducing the time needed to create a detection model on very large databases and suggests three different optimization techniques. First, the perceptron algorithm was adjusted to use the map-reduce paradigm in order to make it run in a distribute manner. Second, hardware specific optimizations were applied for faster mathematical computations. Finally, a cache system was used to reduce the quantity of data processed by the algorithm. Even if these methods were designed and tested for malware databases they can easily be adjusted for other databases as well. |
Year | DOI | Venue |
---|---|---|
2012 | 10.1109/SYNASC.2012.33 | Symbolic and Numeric Algorithms for Scientific Computing |
Keywords | Field | DocType |
practical optimizations,perceptron algorithms,increasing number,large databases,increasing training time,detection model,perceptron algorithm,cache system,malware databases,large malware dataset,malware family,malware sample,automated malware detection,learning artificial intelligence,perceptrons | Computer science,Cache,Algorithm,Theoretical computer science,Artificial intelligence,Malware,Perceptron,Machine learning,Computation | Conference |
ISSN | ISBN | Citations |
2470-8801 | 978-1-4673-5026-6 | 1 |
PageRank | References | Authors |
0.34 | 10 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Dragos Gavrilut | 1 | 62 | 7.95 |
Razvan Benchea | 2 | 26 | 3.76 |
Cristina Vatamanu | 3 | 31 | 3.61 |