HASSANIYA Dataset

Published: 18 February 2025| Version 1 | DOI: 10.17632/m2swkr2bhx.1
Contributor:
Med El Moustapha El ARBY

Description

The attached file is the first Mauritanian dialect dataset called “HASSANIYA” containing two thousand records classified into three categories: positive, negative and neutral. This dataset was collected using web scraping tools from comments posted on the Facebook platform, and Label Studio was used to annotate each record.

Files

Institutions

Universite de Nouakchott, Universite Sidi Mohamed Ben Abdallah

Categories

Natural Language Processing, Dialect, Text Mining, Sentiment Analysis

Licence