Ir directamente a la navegación principal Ir directamente a la búsqueda Ir directamente al contenido principal

A Comparative Study of Machine Learning Models for Two-Tier Android Malware Classification with Dynamic Behavioral Analysis

  • Jorge Torres
  • , Felipe Grijalva
  • , David Chushig-Muzo
  • , Luis Bote Curiel
  • , Malena Loza*
  • *Autor correspondiente de este trabajo
  • Universidad San Francisco de Quito
  • Universidad Rey Juan Carlos

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

Resumen

The rapid proliferation of android malware has emerged as a critical threat to global cybersecurity. This study comparatively evaluates five supervised classification algorithms, including Random Forest (RF), Support Vector Machines (SVM) with RBF kernel, Artificial Neural Networks (ANNs), Naive Bayes and the novel TabNet model. The CCCS-CIC-AndMal-2020 dataset is used that comprises 200,000 malware samples categorized into 14 classes and 191 families, with features dynamically extracted during application execution in emulated environments. The predictive performance was assessed at two hierarchical classification approaches, distinguishing between broad malware categories and family-level attribution. To address class imbalance, oversampling techniques were considered. Precision, recall, and F1-score metrics, complemented by confusion matrices and ROC curves, were utilized for comprehensive evaluation. Statistical significance of differences among classifiers was determined using Friedman and Nemenyi post-hoc tests. Experimental results showed that RF, SVM, and ANNs consistently outperform other models across most metrics. This research provides a robust analytical framework for developing intelligent malware detection systems, contributing significantly to enhanced mobile cybersecurity.

Idioma originalInglés
Título de la publicación alojadaIntelligent Data Engineering and Automated Learning, IDEAL 2025 - 26th International Conference, Proceedings
EditoresLuis Martínez, David Camacho, Hujun Yin, Bapi Dutta, Raciel Yera, Rosa M. Rodríguez Domínguez, Antonio Tallón-Ballesteros
EditorialSpringer Science and Business Media Deutschland GmbH
Páginas445-456
Número de páginas12
ISBN (versión impresa)9783032104854
DOI
EstadoPublicada - 2026
Evento26th International Conference on Intelligent Data Engineering and Automated Learning, IDEAL 2025 - Jaén, Espana
Duración: 13 nov. 202515 nov. 2025

Serie de la publicación

NombreLecture Notes in Computer Science
Volumen16238 LNCS
ISSN (versión impresa)0302-9743
ISSN (versión digital)1611-3349

Conferencia

Conferencia26th International Conference on Intelligent Data Engineering and Automated Learning, IDEAL 2025
País/TerritorioEspana
CiudadJaén
Período13/11/2515/11/25

Huella

Profundice en los temas de investigación de 'A Comparative Study of Machine Learning Models for Two-Tier Android Malware Classification with Dynamic Behavioral Analysis'. En conjunto forman una huella única.

Citar esto