Hunting Android Malware Using Multimodal Deep Learning and Hybrid Analysis Data

Título: Hunting Android Malware Using Multimodal Deep Learning and Hybrid Analysis Data

Autores: Angelo Oliveira and Renato Sassi.

Resumo:
In this work, we propose a new multimodal Deep Learning (DL) Android malware detection method, Chimera, that combines both manual and automatic feature engineering by using the DL architectures, Convolutional Neural Networks (CNN), Deep Neural Networks (DNN), and Transformer Networks (TN) to perform feature learning from raw data (Dalvik Executables (DEX)), static analysis data (Android Intents & Permissions), and dynamic analysis data (system call sequences) respectively. To train and evaluate our model, we implemented the Knowledge Discovery in Databases (KDD) process and used the publicly available Android benchmark dataset Omnidroid. By leveraging a hybrid source of information to learn high-level feature representations for both the static and dynamic properties of Android applications, Chimera’s detection Accuracy, Precision, and Recall outperform classical Machine Learning (ML) algorithms, state-of-the-art Ensemble, and Voting Ensembles ML methods, as well as unimodal DL methods using CNNs, DNNs, TNs, and Long-Short Term Memory Networks (LSTM). To the best of our knowledge, this is the first work that successfully applies multimodal DL to combine those three different modalities of data using DNNs, CNNs, and TNs to learn a shared representation that can be used in Android malware detection tasks.

Palavras-chave:
Android Malware Detection, Computer Security, Multimodal Deep Learning.

Páginas: 10

Código DOI: 10.21528/CBIC2021-32

Artigo em pdf: CBIC_2021_paper_32.pdf

Arquivo BibTeX: CBIC_2021_32.bib