Ir directamente a la navegación principal Ir directamente a la búsqueda Ir directamente al contenido principal

Ensemble Learning for Fake News Detection: Enhancing Classification Accuracy and Explainability with Structural and Statistical Metadata

  • Sebastián González-Celi
  • , Henry N. Roa*
  • , Jorge Cruz-Silva
  • , Edison Loza-Aguirre
  • , Nelson Salgado-Reyes
  • , Javier Guaña-Moya
  • *Autor correspondiente de este trabajo
  • Pontificia Universidad Católica del Ecuador

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

1 Cita (Scopus)

Resumen

The proliferation of fake news on digital platforms presents significant societal challenges, necessitating the development of robust and interpretable detection systems. This study proposes a stacking-based ensemble learning model that integrates XGBoost and Logistic Regression to improve fake news classification accuracy while enhancing model transparency. Unlike traditional Natural Language Processing (NLP) approaches, which rely solely on textual analysis, this model incorporates structural and statistical metadata features, such as publication date and article length, to improve generalizability across misinformation domains. Experimental results on the Spanish Political Fake News dataset demonstrate that the stacking ensemble model outperforms individual classifiers, achieving an F1-score of 95.2% and a ROC-AUC of 0.974. SHAP (Shapley Additive Explanations) analysis enhances interpretability by identifying the most influential features contributing to classification decisions, confirming that metadata plays a critical role in misinformation detection. These findings highlight the effectiveness of hybrid machine-learning approaches that combine textual and structural information for scalable misinformation detection. The study’s contributions include a highly accurate and explainable classification model, positioning ensemble learning as a viable solution for real-world applications in automated fact-checking, journalism, and social media moderation.

Idioma originalInglés
Título de la publicación alojadaIntelligent Systems and Applications - Proceedings of the 2025 Intelligent Systems Conference IntelliSys
EditoresKohei Arai
EditorialSpringer Science and Business Media Deutschland GmbH
Páginas485-502
Número de páginas18
ISBN (versión impresa)9783031999642
DOI
EstadoPublicada - 2025
Evento11th Intelligent Systems Conference, IntelliSys 2025 - Amsterdam, Países Bajos
Duración: 28 ago. 202529 ago. 2025

Serie de la publicación

NombreLecture Notes in Networks and Systems
Volumen1554 LNNS
ISSN (versión impresa)2367-3370
ISSN (versión digital)2367-3389

Conferencia

Conferencia11th Intelligent Systems Conference, IntelliSys 2025
País/TerritorioPaíses Bajos
CiudadAmsterdam
Período28/08/2529/08/25

Huella

Profundice en los temas de investigación de 'Ensemble Learning for Fake News Detection: Enhancing Classification Accuracy and Explainability with Structural and Statistical Metadata'. En conjunto forman una huella única.

Citar esto