Ensemble Malware Classification Using Neural Networks

Piotr Wyrwiński; Jakub Dutkiewicz; Czesław Jędrzejek

doi:10.1007/978-3-030-59000-0_10

System Informacji Naukowej Politechniki Poznańskiej

PL EN

Strona główna / Publikacje / Ensemble Malware Classification Using Neural Networks

Zgłoś uwagę

Rozdział

Pobierz BibTeX

Tytuł

Ensemble Malware Classification Using Neural Networks

Autorzy

Piotr Wyrwiński (WIiT) ^{[ 1 ][ S ]}
Jakub Dutkiewicz (WIiT) ^{[ 1 ][ 2.3 ][ P ]}
Czesław Jędrzejek (WIiT) ^{[ 1 ][ 2.3 ][ P ]}

^{[ 1 ]} Instytut Informatyki, Wydział Informatyki i Telekomunikacji, Politechnika Poznańska | ^{[ S ]} student | ^{[ P ]} pracownik

Dyscyplina naukowa (Ustawa 2.0)

[2.3] Informatyka techniczna i telekomunikacja

Rok publikacji

2020

Typ rozdziału

rozdział w monografii naukowej / referat

Język publikacji

angielski

Słowa kluczowe

EN

Malware detection
Microsoft Malware Classification Challenge
Malware neural networks

Streszczenie

EN This work presents an experimental study of malware classification using the Microsoft Malware Classification Challenge 2015 dataset. We combine the approach of the winning solution to the Microsoft Malware Classification Challenge with the neural network approach. Using a combination of n-grams features for both assembly (asm) and byte code enables us to significantly improve the result. By mixing multiple approaches, we are able to get the best log-loss result of 0.0025, so far. This comes mostly from the classical XGBoost method with n-gram contributions from the binary and assembly code. However, understanding this result is still incomplete. The standard neural network approaches (even with LSTM) alone give poorer results compared to the XGBoost, based on mostly n-gram. It is not clear why adding 6-grams to the binary code analysis does not improve results. There are many more options to be tested in the future, in particular networks.

Strony (od-do)

125 - 138

DOI

10.1007/978-3-030-59000-0_10

URL

https://link.springer.com/chapter/10.1007/978-3-030-59000-0_10

Książka

Multimedia Communications, Services and Security : 10th International Conference, MCSS 2020, Kraków, Poland, October 8-9, 2020, Proceedings

Zaprezentowany na

10th International Conference on Multimedia Communications, Services and Security, MCSS 2020, 8-9.10.2020, Kraków, Polska

Typ licencji

inne