Ir directamente a la navegación principal Ir directamente a la búsqueda Ir directamente al contenido principal

Predicting MOOC Student Success Using Aggregated Behavioral Features and Machine Learning Under the CRISP-ML(Q) Framework

  • María Belén Toledo
  • , Henry N. Roa*
  • , Edison Loza-Aguirre
  • , Eduardo Espinosa-Avila
  • *Autor correspondiente de este trabajo
  • Pontificia Universidad Católica del Ecuador
  • University of San Francisco
  • Universidad Nacional Autónoma de México

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

Resumen

This study examines the application of machine learning to predict academic performance in Massive Open Online Courses (MOOCs), aiming to address the persistent challenge of low completion rates despite high enrollment rates. Identifying at-risk students early in the course is crucial for enabling targeted interventions to improve engagement and retention. The research is essential because it offers a practical and scalable approach for early detection using only behavioral data readily available during the initial stages of learning. The study utilized data from a Coursera MOOC, comprising over 8,000 anonymized student records and more than 80 event log files. Aggregated behavioral features—such as total sessions, number of quizzes completed, lecture views, supplementary activity access, and average session duration—were extracted from the first 50% of course participation. Following the CRISP-ML(Q) framework, three predictive models were developed and compared: logistic regression, decision trees, and support vector machines. The results showed that the decision tree classifier achieved the highest performance, with 90.9% accuracy and balanced precision and recall. These findings suggest that early-stage, aggregated behavioral metrics can predict course outcomes without relying on complex sequence modeling. The implications are significant for MOOC providers and educational institutions seeking real-time, interpretable, and scalable solutions to support student success. The proposed model can guide early interventions and improve overall course completion rates in online learning environments.

Idioma originalInglés
Título de la publicación alojadaProceedings of the Future Technologies Conference, FTC 2025, Volume 3
EditoresKohei Arai
EditorialSpringer Science and Business Media Deutschland GmbH
Páginas226-242
Número de páginas17
ISBN (versión impresa)9783032079947
DOI
EstadoPublicada - 2026
Publicado de forma externa
EventoFuture Technologies Conference, FTC 2025 - Munich, Alemania
Duración: 6 nov. 20257 nov. 2025

Serie de la publicación

NombreLecture Notes in Networks and Systems
Volumen1677 LNNS
ISSN (versión impresa)2367-3370
ISSN (versión digital)2367-3389

Conferencia

ConferenciaFuture Technologies Conference, FTC 2025
País/TerritorioAlemania
CiudadMunich
Período6/11/257/11/25

Huella

Profundice en los temas de investigación de 'Predicting MOOC Student Success Using Aggregated Behavioral Features and Machine Learning Under the CRISP-ML(Q) Framework'. En conjunto forman una huella única.

Citar esto