Synthetic Datasets for Scenario-based Data Breaches
Published: 22 October 2024| Version 1 | DOI: 10.17632/sxfjgcynjv.1
Contributors:
, Description
We have synthetically generated synthetic datasets for scenario based data breaches. There are two kinds of datasets: 1. Master Record Table (MRT) which consists of 4 million records of the individuals profiled with several PIIs which is synthetically generated programatically; 2. 16 Scenario based datasets depicting various fictitious data breaches with varying number of records and PIIs are also distributed across for the variability. Furthermore we have also included the code such that practitioners, researchers and others can use the code further. This enables transparency in the form of reusability, reproducibility, and replicability.
Files
Categories
Cybersecurity, Privacy