UTeMo audiovisual dataset for emotion recognition

Published: 20 February 2024| Version 1 | DOI: 10.17632/x5rmd28h73.1
Contributors:
,
,
,

Description

UTeMo is the first database recorded with statements lexically vocalized in a Mexican variant of the Spanish language, which is specifically designed for multi-modal (audiovisual) emotion recognition (using vocal and facial expressions). It comprises 1801 video samples with a total of 105 minutes. It is composed of high quality data, as it supplies high resolution images (1920x1080 pixels at 30 fps) and high fidelity audio (sampling rate of 48KHz) files. UTeMo can be considered as a database whose number of samples is balanced according to the seven emotion classes (sadness, surprise, joy, anger, fear, disgust and neutral), so every emotional state is well represented.

Files

Institutions

Universidad Tecnologica de la Mixteca

Categories

Machine Learning, Emotion, Deep Learning, Expression Recognition

Funding

Consejo Nacional de Ciencia y Tecnología

Licence