A Zoom into Ecuadorian Politics: Manifesto Text Classification using NLP

Fernanda Barzallo, María Emilia Moscoso, Margorie Pérez, María Baldeon-Calisto, Danny Navarrete, Daniel Riofrío, Pablo Medina-Pérez, Susana K. Lai-Yuen

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

Resumen

Political science research on party manifestos helps to understand a candidate’s strategies and proposed actions during electoral campaigns. To achieve this, political scientists classify a manifesto’s sentences and quasi-sentences into seven main domains established by the Comparative Manifesto Project. However, manually coding is a time-consuming and labor-intensive task that can lead to biases. Automatically classifying manifestos has shown to produce good and reproducible annotations. In Ecuador, research on automatic manifesto analysis has been limited. Moreover, there is no large labeled Ecuadorian corpus available for training. Therefore, in this work we develop a Transformer network for automatically analyzing Ecuadorian manifestos using a cross-domain training approach. We implement a fractional factorial experimental design to determine which Transformer model, type of pre-processing operations, and Spanish text data should be used to maximize the accuracy of the classification model. The results show that the DistilBERT architecture trained with Mexico’s and Argentina´s manifestos increase the classification accuracy. Without using an Ecuadorian corpus for training, the implemented DistilBERT achieves a 44% accuracy on the Ecuadorian test set, which has a comparable performance to other models in literature trained with a Spanish corpus.

Idioma originalInglés
Título de la publicación alojada2023 IEEE 13th International Conference on Pattern Recognition Systems, ICPRS 2023
EditorialInstitute of Electrical and Electronics Engineers Inc.
ISBN (versión digital)9798350333374
DOI
EstadoPublicada - 4 jul. 2023
Evento13th IEEE International Conference on Pattern Recognition Systems, ICPRS 2023 - Guayaquil, Ecuador
Duración: 4 jul. 20237 jul. 2023

Serie de la publicación

Nombre2023 IEEE 13th International Conference on Pattern Recognition Systems (ICPRS)

Conferencia

Conferencia13th IEEE International Conference on Pattern Recognition Systems, ICPRS 2023
País/TerritorioEcuador
CiudadGuayaquil
Período4/07/237/07/23

Huella

Profundice en los temas de investigación de 'A Zoom into Ecuadorian Politics: Manifesto Text Classification using NLP'. En conjunto forman una huella única.

Citar esto