MultilingualTweetEval2024

Published: 30 December 2024| Version 1 | DOI: 10.17632/8x3fzj5t3c.1
Contributors:
,
,

Description

MultilingualTweetEval2024 The MultilingualTweetEval2024 dataset consists of two subsets: 1.General Purpose: Includes anonymized tweets in multiple languages (Chinese, English, Spanish, French, and German) containing only the text. 2.English Labeled: Contains English tweets that have been auto-labeled using a custom model for hate speech detection. Both datasets are fully anonymized and designed to facilitate research in multilingual tweet analysis and hate speech detection in Twitter content.

Files

Institutions

Universidad de La Laguna

Categories

Artificial Intelligence, Social Media, Natural Language Processing, Machine Learning, Speech Analysis, Twitter, Sentiment Analysis

Licence