HASSANIYA Dataset

Published: 18 February 2025| Version 1 | DOI: 10.17632/m2swkr2bhx.1
Contributor:
Med El Moustapha El ARBY

Description

The attached file is the first Mauritanian dialect dataset called “HASSANIYA” containing two thousand records classified into three categories: positive, negative and neutral. This dataset was collected using web scraping tools from comments posted on the Facebook platform, and Label Studio was used to annotate each record.

Files

Institutions

  • Universite de Nouakchott
  • Universite Sidi Mohamed Ben Abdallah

Categories

Natural Language Processing, Dialect, Text Mining, Sentiment Analysis

Licence