An approach to detecting text autorship in the Spanish language

Mauricio Iturralde, Roberto Maldonado, Daniel Fellig

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

1 Cita (Scopus)

Resumen

Authors tend to express themselves using language in ways that reflect particular styles, vocabularies, biases, idioms, etc. These features can be captured in the so-called firm or stylone. Although capturing these attributes with high fidelity has proven to be very challenging, some advances have been made. Stylometry is the analysis of the unique attributes that are expressed by an author unconsciously through his or her publications. In this paper we investigate techniques for the detection of authorship patterns from the text content of a large number of digital documents, including e-mails, academic notes and free redaction in the Spanish language. A mechanism based on pondering parameters, including statistical observations, extracting a pattern is proposed. We defined 150 stylistic criteria parameters adapted to the Spanish language to compute our metric. Extensive experiment results are also presented.

Idioma originalInglés
Título de la publicación alojada2016 8th IFIP International Conference on New Technologies, Mobility and Security, NTMS 2016
EditoresMohamad Badra, Giovanni Pau, Vasos Vassiliou
EditorialInstitute of Electrical and Electronics Engineers Inc.
ISBN (versión digital)9781509029143
DOI
EstadoPublicada - 20 dic. 2016
Evento8th IFIP International Conference on New Technologies, Mobility and Security, NTMS 2016 - Larnaca, Chipre
Duración: 21 nov. 201623 nov. 2016

Serie de la publicación

Nombre2016 8th IFIP International Conference on New Technologies, Mobility and Security, NTMS 2016

Conferencia

Conferencia8th IFIP International Conference on New Technologies, Mobility and Security, NTMS 2016
País/TerritorioChipre
CiudadLarnaca
Período21/11/1623/11/16

Huella

Profundice en los temas de investigación de 'An approach to detecting text autorship in the Spanish language'. En conjunto forman una huella única.

Citar esto