ROI: A method for identifying organizations receiving personal data

Published: 3 August 2023| Version 1 | DOI: 10.17632/3mdyg53c94.1
Contributors:
David Rodríguez,
,
,

Description

Article Contributions This file enumerates and explains the distribution of the datasets in the files. Privacy Policies dataset This dataset ["Policies_urls.csv"] contains 142 privacy policy URLs with the corresponding organization. These URLs were obtained with the two methods (Selenium & Google) described in the article. This is the reason for duplicated URLs. 300 Domain Holders This dataset ["300_domain_holders.xlsx"] contains three different sheets for each of the datasets used for the validations described in the article i.e. Fortune 500, PII_receivers_1 (for the technique's evaluation) and PII_receivers_2 (for ROI's evaluation). Recipient Domains this dataset ["Domains_receiving_PII.csv"] contains the 40,493 dataflows corresponding to the 1,112 unique domains receiving personal data from Android apps.

Files

Institutions

  • Universidad Politecnica de Madrid Escuela Tecnica Superior de Ingenieros de Telecomunicacion

Categories

Computer Science, Information Science, Computer Security and Privacy

Funders

Licence