Elsagate Corpus

Published: 28 June 2024| Version 1 | DOI: 10.17632/pgcp2r75zf.1
Contributor:
Panagiotis Soustas

Description

Identifying disturbing online content being targeted at children is an important content moderation problem. However, previous approaches to this problem have focused on features of the content itself, and neglected potentially helpful insights from the reactions expressed by its online audience. To help remedy this, we present the Elsagate Corpus, a collection of over 22 million comments on more than 18,000 videos that have been associated with disturbing content.

Files

Institutions

University of Bristol

Categories

Cybersecurity, Computational Linguistics, Natural Language Processing

Licence