MSAPersonality Dataset: A Modern Standard Arabic Resource for Personality Recognition
Description
This dataset contains 267 texts written in Modern Standard Arabic, annotated with the Big Five personality traits: Openness, Conscientiousness, Extraversion, Agreeableness, and Neuroticism. It was specifically designed for Automatic Personality Recognition (APR), a computational task aimed at inferring personality traits from textual or other data sources. The dataset provides a valuable resource for researchers working at the intersection of computational linguistics, psychology, and machine learning.
Files
Steps to reproduce
This dataset is described in detail in the journal article: K. Chraibi, I. Chaker, Y. Dhassi, and A. Zahi, "MSAPersonality: a Modern Standard Arabic Dataset for Personality Recognition," International Journal of Electrical and Computer Engineering, vol. 14, no. 4, pp. 4498-4507, 2024, doi: 10.11591/ijece.v14i4.pp4498-4507. Please refer to the paper for methodological details and validation.