REST API response with or without PII (synthetic & labeled)
Published: 10 March 2025| Version 1 | DOI: 10.17632/2965grswcp.1
Contributor:
Akbar Sahata SakapertanaDescription
Synthetic dataset of request and response from REST API in JSON format. Dataset and Datauji (testing data) comprised of data of random mimicking request and response log of REST API system. Data is generated and labeled by code that emulates several context of businesses such as commerce and health (http://dx.doi.org/10.13140/RG.2.2.17184.90880). The purpose is to be used to train machine learning models to detect whether some REST API endpoints might leaks PII sensitive data and to assess the severity of leaking.
Files
Steps to reproduce
https://github.com/WanMuhafidzFaldi/machine-learning-pii-detection-model
Institutions
Institut Teknologi Sepuluh Nopember
Categories
Cybersecurity, Software Engineering, Privacy-Preserving Technique