Skip to main navigation Skip to search Skip to main content

End-to-End Canine Emotion Recognition from Images to Video: EfficientNetV2-S and Visual Interpretability Study

  • Lucía Montaluisa*
  • , Roberto Andrade
  • , Felipe Grijalva
  • *Corresponding author for this work
  • Universidad San Francisco de Quito

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This study presents a system for the automatic classification ofdoge motions-happiness, s adness, a nger, and relaxation-using the EfficientNetV2-S a rchitecture, t rained on images and adapted for video inference. The model was trained on the public Dog Emotion dataset from Kaggle, employing transfer learning with ImageNet1K_V1 weights, combined with data augmentation techniques and stratified cross-validation. In addition, Grad-CAM was integrated as a visual explainability tool, enabling the identification of r elevant a natomical regions associated with each emotion. The system achieved an 85.43% accuracy on the test set, representing the best performance reported to date on this dataset, surpassing the results presented in previous state-of-theart studies. These results, combined with strong consistency in the visual activations, validate the potential of this approach as an efficient and explainable tool for canine emotional analysis in real-world contexts.

Original languageEnglish
Title of host publicationETCM 2025 - 9th Ecuador Technical Chapters Meeting
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798331552640
DOIs
StatePublished - 2025
Event9th Ecuador Technical Chapters Meeting, ETCM 2025 - Quito, Ecuador
Duration: 21 Oct 202524 Oct 2025

Publication series

NameETCM 2025 - 9th Ecuador Technical Chapters Meeting

Conference

Conference9th Ecuador Technical Chapters Meeting, ETCM 2025
Country/TerritoryEcuador
CityQuito
Period21/10/2524/10/25

Keywords

  • Emotion Recognition
  • Grad-CAM
  • Visual Interpretability

Fingerprint

Dive into the research topics of 'End-to-End Canine Emotion Recognition from Images to Video: EfficientNetV2-S and Visual Interpretability Study'. Together they form a unique fingerprint.

Cite this