REST API response with or without PII (synthetic & labeled)

Published: 10 March 2025| Version 1 | DOI: 10.17632/2965grswcp.1
Contributor:
Akbar Sahata Sakapertana

Description

Synthetic dataset of request and response from REST API in JSON format. Dataset and Datauji (testing data) comprised of data of random mimicking request and response log of REST API system. Data is generated and labeled by code that emulates several context of businesses such as commerce and health (http://dx.doi.org/10.13140/RG.2.2.17184.90880). The purpose is to be used to train machine learning models to detect whether some REST API endpoints might leaks PII sensitive data and to assess the severity of leaking.

Files

Steps to reproduce

https://github.com/WanMuhafidzFaldi/machine-learning-pii-detection-model

Institutions

Institut Teknologi Sepuluh Nopember

Categories

Cybersecurity, Software Engineering, Privacy-Preserving Technique

Licence