Tracking the Global Pulse: The first public Twitter dataset from FIFA World Cup

Published: 14 October 2024| Version 1 | DOI: 10.17632/gw3mcnbkwr.1
Contributors:
,
,
,

Description

The first public large-scale multilingual Twitter dataset related to the FIFA World Cup 2022, comprising over 28 million posts in 69 unique spoken languages, including Arabic, English, Spanish, French, and many others. This dataset aims to facilitate research in future sentiment analysis, cross-linguistic studies, event-based analytics, meme and hate speech detection, fake news detection, and social manipulation detection. File contain two column, "Id": is the id of the post, and "RetweetId": is to the Retweet Id if the post is retweet else equal 0.

Files

Institutions

Universite de Lille, Universite Mohamed Khider de Biskra, Universite de Tebessa Faculte des Sciences Exactes et des Sciences de la Nature et de la Vie

Categories

English, French Language, Multilingualism, Arabic Language, Japanese Language, Spanish Language, Portuguese Language, FIFA World Cup, Twitter

Licence