HASSANIYA Dataset
Published: 18 February 2025| Version 1 | DOI: 10.17632/m2swkr2bhx.1
Contributor:
Med El Moustapha El ARBYDescription
The attached file is the first Mauritanian dialect dataset called “HASSANIYA” containing two thousand records classified into three categories: positive, negative and neutral. This dataset was collected using web scraping tools from comments posted on the Facebook platform, and Label Studio was used to annotate each record.
Files
Institutions
Universite de Nouakchott, Universite Sidi Mohamed Ben Abdallah
Categories
Natural Language Processing, Dialect, Text Mining, Sentiment Analysis