Data files: GenAI Detection Tools, Adversarial Techniques and Implications for Inclusivity in Higher Education

Published: 27 March 2024| Version 2 | DOI: 10.17632/xv6fk2mmh9.2
Contributors:
Mike Perkins,
,
,
,
,
,

Description

This data consists of samples of human and AI generated short form content in the form of MS Word files. AI generated samples are subjected to adversarial techniques to evade content detection by GenAI text detectors. Our accompanying paper 'GenAI Detection Tools, Adversarial Techniques and Implications for Inclusivity in Higher Education' discusses the results of the testing. This data can be used to test how effective AI text detectors are at determining whether a sample is human or machine generated. We identify how the application of adversarial techniques reduces the ability of AI text detectors to accurately determine the source of the sample.

Files

Steps to reproduce

Human generated samples were written by listed authors, machine generated samples were generated and manipulated using GPT-4, Claude 2, and Bard between September and October 2023. Results will vary depending on follow on prompts and versions of Foundation Models used.

Institutions

British University Vietnam

Categories

Artificial Intelligence, Education, Higher Education

Licence