MultilingualTweetEval2024

Published: 30 December 2024| Version 1 | DOI: 10.17632/8x3fzj5t3c.1
Contributors:
,
, Carlos Rosa Remedios

Description

MultilingualTweetEval2024 The MultilingualTweetEval2024 dataset consists of two subsets: 1.General Purpose: Includes anonymized tweets in multiple languages (Chinese, English, Spanish, French, and German) containing only the text. 2.English Labeled: Contains English tweets that have been auto-labeled using a custom model for hate speech detection. Both datasets are fully anonymized and designed to facilitate research in multilingual tweet analysis and hate speech detection in Twitter content.

Files

Institutions

  • Universidad de La Laguna

Categories

Artificial Intelligence, Social Media, Natural Language Processing, Machine Learning, Speech Analysis, Twitter, Sentiment Analysis

Licence