UTeMo audiovisual dataset for emotion recognition
Published: 20 February 2024| Version 1 | DOI: 10.17632/x5rmd28h73.1
Contributors:
, , , Description
UTeMo is the first database recorded with statements lexically vocalized in a Mexican variant of the Spanish language, which is specifically designed for multi-modal (audiovisual) emotion recognition (using vocal and facial expressions). It comprises 1801 video samples with a total of 105 minutes. It is composed of high quality data, as it supplies high resolution images (1920x1080 pixels at 30 fps) and high fidelity audio (sampling rate of 48KHz) files. UTeMo can be considered as a database whose number of samples is balanced according to the seven emotion classes (sadness, surprise, joy, anger, fear, disgust and neutral), so every emotional state is well represented.
Files
Institutions
Universidad Tecnologica de la Mixteca
Categories
Machine Learning, Emotion, Deep Learning, Expression Recognition
Funding
Consejo Nacional de Ciencia y Tecnología