Skip to main content
Exit comparison
Removed
Added

Datasets Comparison

Version 1

Elsagate Corpus

Published:28 June 2024|Version 1|DOI:10.17632/pgcp2r75zf.1
Contributor:Panagiotis Soustas

Description

Identifying disturbing online content being targeted at children is an important content moderation problem. However, previous approaches to this problem have focused on features of the content itself, and neglected potentially helpful insights from the reactions expressed by its online audience. To help remedy this, we present the Elsagate Corpus, a collection of over 22 million comments on more than 18,000 videos that have been associated with disturbing content.

Institutions

University of Bristol

Categories

Cybersecurity, Computational Linguistics, Natural Language Processing

Licence

Creative Commons Attribution 4.0 International

Version 2

Elsagate Corpus

Published:20 November 2024|Version 2|DOI:10.17632/pgcp2r75zf.2
Contributor:Panagiotis Soustas

Description

Identifying disturbing online content being targeted at children is an important content moderation problem. However, previous approaches to this problem have focused on features of the content itself, and neglected potentially helpful insights from the reactions expressed by its online audience. To help remedy this, we present the Elsagate Corpus, a collection of over 22 million comments on more than 18,000 videos that have been associated with disturbing content.

Institutions

University of Bristol

Categories

Cybersecurity, Computational Linguistics, Natural Language Processing

Licence

Creative Commons Attribution 4.0 International