Resumen
Content analysis of political manifestos is necessary to understand the policies and proposed actions of a party. However, manually labeling political texts is time-consuming and labor-intensive. Transformer networks have become essential tools for automating this task. Nevertheless, these models require extensive datasets to achieve good performance. This can be a limitation in manifesto classification, where the availability of publicly labeled datasets can be scarce. To address this challenge, in this work, we developed a Transformer network for the classification of manifestos using a cross-domain training strategy. Using the database of the Comparative Manifesto Project, we implemented a fractional factorial experimental design to determine which Spanish-written manifestos form the best training set for Ecuadorian manifesto labeling. Furthermore, we statistically analyzed which Transformer architecture and preprocessing operations improve the model accuracy. The results indicate that creating a training set with manifestos from Spain and Uruguay, along with implementing stemming and lemmatization preprocessing operations, produces the highest classification accuracy. In addition, we found that the DistilBERT and RoBERTa transformer networks perform statistically similarly and consistently well in manifesto classification. Using the cross-context training strategy, DistilBERT and RoBERTa achieve 60.05% and 57.64% accuracy, respectively, in the classification of the Ecuadorian manifesto. Finally, we investigated the effect of the composition of the training set on performance. The experiments demonstrate that training DistilBERT solely with Ecuadorian manifestos achieves the highest accuracy and F1-score. Furthermore, in the absence of the Ecuadorian dataset, competitive performance is achieved by training the model with datasets from Spain and Uruguay.
| Idioma original | Inglés |
|---|---|
| Páginas (desde-hasta) | 578-603 |
| Número de páginas | 26 |
| Publicación | Social Science Computer Review |
| Volumen | 43 |
| N.º | 3 |
| DOI | |
| Estado | Publicada - jun. 2025 |
Huella
Profundice en los temas de investigación de 'A Transformer Model for Manifesto Classification Using Cross-Context Training: An Ecuadorian Case Study'. En conjunto forman una huella única.Prensa/Medios de comunicación
-
Universidad San Francisco de Quito Researcher Updates Current Data on Social Science and Computers (A Transformer Model for Manifesto Classification Using Cross-Context Training: An Ecuadorian Case Study)
Riofrío, D., Benítez, D., Baldeón Calisto, M., Navarrete, D., Flores Moyano, R., Medina Pérez, P. & Pérez, N.
9/08/24
1 elemento de Cobertura del medio de comunicación
Prensa/medios de comunicación
Citar esto
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver