Skip to main navigation Skip to search Skip to main content

Ensemble Learning for Fake News Detection: Enhancing Classification Accuracy and Explainability with Structural and Statistical Metadata

  • Sebastián González-Celi
  • , Henry N. Roa*
  • , Jorge Cruz-Silva
  • , Edison Loza-Aguirre
  • , Nelson Salgado-Reyes
  • , Javier Guaña-Moya
  • *Corresponding author for this work
  • Pontificia Universidad Católica del Ecuador

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

The proliferation of fake news on digital platforms presents significant societal challenges, necessitating the development of robust and interpretable detection systems. This study proposes a stacking-based ensemble learning model that integrates XGBoost and Logistic Regression to improve fake news classification accuracy while enhancing model transparency. Unlike traditional Natural Language Processing (NLP) approaches, which rely solely on textual analysis, this model incorporates structural and statistical metadata features, such as publication date and article length, to improve generalizability across misinformation domains. Experimental results on the Spanish Political Fake News dataset demonstrate that the stacking ensemble model outperforms individual classifiers, achieving an F1-score of 95.2% and a ROC-AUC of 0.974. SHAP (Shapley Additive Explanations) analysis enhances interpretability by identifying the most influential features contributing to classification decisions, confirming that metadata plays a critical role in misinformation detection. These findings highlight the effectiveness of hybrid machine-learning approaches that combine textual and structural information for scalable misinformation detection. The study’s contributions include a highly accurate and explainable classification model, positioning ensemble learning as a viable solution for real-world applications in automated fact-checking, journalism, and social media moderation.

Original languageEnglish
Title of host publicationIntelligent Systems and Applications - Proceedings of the 2025 Intelligent Systems Conference IntelliSys
EditorsKohei Arai
PublisherSpringer Science and Business Media Deutschland GmbH
Pages485-502
Number of pages18
ISBN (Print)9783031999642
DOIs
StatePublished - 2025
Event11th Intelligent Systems Conference, IntelliSys 2025 - Amsterdam, Netherlands
Duration: 28 Aug 202529 Aug 2025

Publication series

NameLecture Notes in Networks and Systems
Volume1554 LNNS
ISSN (Print)2367-3370
ISSN (Electronic)2367-3389

Conference

Conference11th Intelligent Systems Conference, IntelliSys 2025
Country/TerritoryNetherlands
CityAmsterdam
Period28/08/2529/08/25

Keywords

  • Fake news detection
  • Machine learning
  • Metadata-driven classification
  • SHAP analysis
  • Stacking ensemble learning
  • XGBoost

Fingerprint

Dive into the research topics of 'Ensemble Learning for Fake News Detection: Enhancing Classification Accuracy and Explainability with Structural and Statistical Metadata'. Together they form a unique fingerprint.

Cite this