MSAPersonality Dataset: A Modern Standard Arabic Resource for Personality Recognition

Published: 26 November 2024| Version 1 | DOI: 10.17632/k9b68trxj5.1
Contributors:
Khaoula Chraibi, Ilham Chaker, Azeddine Zahi

Description

This dataset contains 267 texts written in Modern Standard Arabic, annotated with the Big Five personality traits: Openness, Conscientiousness, Extraversion, Agreeableness, and Neuroticism. It was specifically designed for Automatic Personality Recognition (APR), a computational task aimed at inferring personality traits from textual or other data sources. The dataset provides a valuable resource for researchers working at the intersection of computational linguistics, psychology, and machine learning.

Files

Steps to reproduce

This dataset is described in detail in the journal article: K. Chraibi, I. Chaker, Y. Dhassi, and A. Zahi, "MSAPersonality: a Modern Standard Arabic Dataset for Personality Recognition," International Journal of Electrical and Computer Engineering, vol. 14, no. 4, pp. 4498-4507, 2024, doi: 10.11591/ijece.v14i4.pp4498-4507. Please refer to the paper for methodological details and validation.

Institutions

Universite Sidi Mohamed Ben Abdellah Faculte des Sciences et Techniques de Fes, Universite Sidi Mohamed Ben Abdallah

Categories

Computer Science, Natural Language Processing, Machine Learning, Personality

Licence