Datasets Comparison
Version 1
Elsagate Corpus
Description
Identifying disturbing online content being targeted at children is an important content moderation problem. However, previous approaches to this problem have focused on features of the content itself, and neglected potentially helpful insights from the reactions expressed by its online audience. To help remedy this, we present the Elsagate Corpus, a collection of over 22 million comments on more than 18,000 videos that have been associated with disturbing content.
Institutions
University of Bristol
Categories
Cybersecurity, Computational Linguistics, Natural Language Processing
Licence
Creative Commons Attribution 4.0 International
Version 2
Elsagate Corpus
Description
Identifying disturbing online content being targeted at children is an important content moderation problem. However, previous approaches to this problem have focused on features of the content itself, and neglected potentially helpful insights from the reactions expressed by its online audience. To help remedy this, we present the Elsagate Corpus, a collection of over 22 million comments on more than 18,000 videos that have been associated with disturbing content.
Institutions
University of Bristol
Categories
Cybersecurity, Computational Linguistics, Natural Language Processing
Licence
Creative Commons Attribution 4.0 International