Synthetic Datasets for Scenario-based Data Breaches

Published: 22 October 2024| Version 1 | DOI: 10.17632/sxfjgcynjv.1
Contributors:
,

Description

We have synthetically generated synthetic datasets for scenario based data breaches. There are two kinds of datasets: 1. Master Record Table (MRT) which consists of 4 million records of the individuals profiled with several PIIs which is synthetically generated programatically; 2. 16 Scenario based datasets depicting various fictitious data breaches with varying number of records and PIIs are also distributed across for the variability. Furthermore we have also included the code such that practitioners, researchers and others can use the code further. This enables transparency in the form of reusability, reproducibility, and replicability.

Files

Categories

Cybersecurity, Privacy

Licence