ROI: A method for identifying organizations receiving personal data

Published: 3 August 2023| Version 1 | DOI: 10.17632/3mdyg53c94.1
Contributors:
David Rodríguez,
,
,

Description

Article Contributions This file enumerates and explains the distribution of the datasets in the files. Privacy Policies dataset This dataset ["Policies_urls.csv"] contains 142 privacy policy URLs with the corresponding organization. These URLs were obtained with the two methods (Selenium & Google) described in the article. This is the reason for duplicated URLs. 300 Domain Holders This dataset ["300_domain_holders.xlsx"] contains three different sheets for each of the datasets used for the validations described in the article i.e. Fortune 500, PII_receivers_1 (for the technique's evaluation) and PII_receivers_2 (for ROI's evaluation). Recipient Domains this dataset ["Domains_receiving_PII.csv"] contains the 40,493 dataflows corresponding to the 1,112 unique domains receiving personal data from Android apps.

Files

Institutions

Universidad Politecnica de Madrid Escuela Tecnica Superior de Ingenieros de Telecomunicacion

Categories

Computer Science, Information Science, Computer Security and Privacy

Funding

Agencia Estatal de Investigación

MCIN/AEI/10.13039/501100011033

Licence