Dataset: How Different Genders Use Profanity on Twitter?

Name: Dataset: How Different Genders Use Profanity on Twitter?
Creator: Phoey Lee Teh
Published: 2021-06-15T14:18:17.654Z
Keywords: Social Media, Speech Analysis, Gender, Profile Analysis, Twitter, Textual Analysis

Teh, Phoey Lee

doi:10.17632/94s687fvdp.1

Dataset: How Different Genders Use Profanity on Twitter?

Published: 15 June 2021| Version 1 | DOI: 10.17632/94s687fvdp.1

Contributor:

Phoey Lee Teh

Description

File 01 - a summary table of keywords used to crawled those tweets in File 02 In this file, we summaries the keywords that used to obtain the 102464 tweets that we refer to the first study from this article- https://dl.acm.org/doi/abs/10.1145/3193077.3193078 File 02 - All the Original tweets that were collected using the keywords from File 01 File 03 - All the processed tweets categories by location File 04 - All the Public and Targeted Male and Female set of Tweets For File 01, the method to obtain the set of keywords is descript in this article https://dl.acm.org/doi/abs/10.1145/3193077.3193078 Phoey Lee Teh, Chi-Bin Cheng, and Weng Mun Chee. 2018. Identifying and Categorising Profane Words in Hate Speech. In Proceedings of the 2nd International Conference on Compute and Data Analysis (ICCDA 2018). Association for Computing Machinery, New York, NY, USA, 65–69. DOI:https://doi.org/10.1145/3193077.3193078 For File 04, the result is published in this article, kindly refer https://dl.acm.org/doi/abs/10.1145/3388142.3388145 Phoey Lee Teh, Chi-Bin Cheng, and Weng Mun Chee. 2018. Identifying and Categorising Profane Words in Hate Speech. In Proceedings of the 2nd International Conference on Compute and Data Analysis (ICCDA 2018). Association for Computing Machinery, New York, NY, USA, 65–69. DOI:https://doi.org/10.1145/3193077.3193078

Files

Steps to reproduce

Step to produce File 01 keywords was explained in the methodology of this article - https://dl.acm.org/doi/abs/10.1145/3193077.3193078 Phoey Lee Teh, Chi-Bin Cheng, and Weng Mun Chee. 2018. Identifying and Categorising Profane Words in Hate Speech. In Proceedings of the 2nd International Conference on Compute and Data Analysis (ICCDA 2018). Association for Computing Machinery, New York, NY, USA, 65–69. DOI:https://doi.org/10.1145/3193077.3193078 Step to produce Original tweets from File 02 into File 03 is descript in the methodology of this article https://dl.acm.org/doi/abs/10.1145/3388142.3388145 Phoey Lee Teh, Chi-Bin Cheng, and Weng Mun Chee. 2018. Identifying and Categorising Profane Words in Hate Speech. In Proceedings of the 2nd International Conference on Compute and Data Analysis (ICCDA 2018). Association for Computing Machinery, New York, NY, USA, 65–69. DOI:https://doi.org/10.1145/3193077.3193078 And File 04 is part of the generated output of Female and Male "Public" and "Target" tweets. Kindly refer to the same article as above to explain the details of what are "Public" and "Target".

Institutions

Sunway University

Dataset: How Different Genders Use Profanity on Twitter?

Description

Files

Steps to reproduce

Institutions

Categories

Licence