MultilingualTweetEval2024
Published: 30 December 2024| Version 1 | DOI: 10.17632/8x3fzj5t3c.1
Contributors:
, , Carlos Rosa RemediosDescription
MultilingualTweetEval2024 The MultilingualTweetEval2024 dataset consists of two subsets: 1.General Purpose: Includes anonymized tweets in multiple languages (Chinese, English, Spanish, French, and German) containing only the text. 2.English Labeled: Contains English tweets that have been auto-labeled using a custom model for hate speech detection. Both datasets are fully anonymized and designed to facilitate research in multilingual tweet analysis and hate speech detection in Twitter content.
Files
Institutions
- Universidad de La Laguna
Categories
Artificial Intelligence, Social Media, Natural Language Processing, Machine Learning, Speech Analysis, Twitter, Sentiment Analysis