Command Palette
Search for a command to run...
Classification of Malware by Using Structural Entropy on Convolutional Neural Networks
{Ramon Vicens Jordi Planes Carles Mateu Daniel Gibert}
Abstract
he number of malicious programs has grown both in number and in sophistication. Analyzing the malicious intent ofvast amounts of data requires huge resources and thus, effective categorization of malware is required. In this paper,the content of a malicious program is represented as an entropy stream, where each value describes the amount of entropy of a small chunk of code in a specific location of the file. Wavelet transforms are then applied to this entropy signal todescribe the variation in the entropic energy. Motivated by the visual similarity between streams of entropy of malicioussoftware belonging to the same family, we propose a file agnostic deep learning approach for categorization of malware.Our method exploits the fact that most variants are generated by using common obfuscation techniques and that compression and encryption algorithms retain some properties present in the original code. This allows us to find discriminative patterns that almost all variants in a family share. Our method has been evaluated using the data provided by Microsoft for the BigData Innovators Gathering Anti-Malware Prediction Challenge, and achieved promising results in comparison with the State of the Art.
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| malware-classification-on-microsoft-malware | Structural entropy CNN | Accuracy (10-fold): 0.9708 LogLoss: 0.134624 Macro F1 (10-fold): 0.9314 |
| malware-classification-on-microsoft-malware | Multiresolution CNN | Accuracy (10-fold): 0.9828 LogLoss: 0.124431 Macro F1 (10-fold): 0.9636 |
| malware-classification-on-microsoft-malware | Multiresolution CNN + Bagging | LogLoss: 0.075081 |
| malware-classification-on-microsoft-malware | Dynamic Time Wrapping + K-NN | Accuracy (10-fold): 0.9894 LogLoss: 0.367724 Macro F1 (10-fold): 0.9813 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.