HumanVSAI_CodeDataset

Published: 30 December 2025| Version 1 | DOI: 10.17632/kjh95n54f8.1
Contributors:
ghizlane boukili, Said elgarouani, Jamal Riffi

Description

This repository provides a labeled dataset and accompanying machine learning workflows designed for the binary classification of code snippets based on their provenance: Human-written or AI-generated. The primary objective is to facilitate research of the detection of machine-generated source code.

Files

Institutions

  • Universite Sidi Mohamed Ben Abdallah

Categories

Artificial Intelligence, Machine Learning

Licence