MultilingualTweetEval2024
Published: 30 December 2024| Version 1 | DOI: 10.17632/8x3fzj5t3c.1
Contributors:
, , Description
MultilingualTweetEval2024 The MultilingualTweetEval2024 dataset consists of two subsets: 1.General Purpose: Includes anonymized tweets in multiple languages (Chinese, English, Spanish, French, and German) containing only the text. 2.English Labeled: Contains English tweets that have been auto-labeled using a custom model for hate speech detection. Both datasets are fully anonymized and designed to facilitate research in multilingual tweet analysis and hate speech detection in Twitter content.
Files
Institutions
Universidad de La Laguna
Categories
Artificial Intelligence, Social Media, Natural Language Processing, Machine Learning, Speech Analysis, Twitter, Sentiment Analysis