HASSANIYA Dataset
Published: 18 February 2025| Version 1 | DOI: 10.17632/m2swkr2bhx.1
Contributor:
Med El Moustapha El ARBYDescription
The attached file is the first Mauritanian dialect dataset called “HASSANIYA” containing two thousand records classified into three categories: positive, negative and neutral. This dataset was collected using web scraping tools from comments posted on the Facebook platform, and Label Studio was used to annotate each record.
Files
Institutions
- Universite de Nouakchott
- Universite Sidi Mohamed Ben Abdallah
Categories
Natural Language Processing, Dialect, Text Mining, Sentiment Analysis