Age dataset: A structured general-purpose dataset on life, work, and death of 1.22 million distinguished people

Published: 27 April 2022| Version 1 | DOI: 10.17632/2sfz4tt88g.1
Contributors:
,

Description

We developed a five-step method and inferred birth and death years, binary gender, and occupation from community-submitted data to all language versions of the Wikipedia project. The dataset is the largest on notable deceased people and includes individuals from a variety of social groups, including but not limited to 107k females, 124 non-binary people, and 90k researchers, who are spread across more than 300 contemporary or historical regions. Related paper accepted to the ICWSM Workshop on Data for the Wellbeing of Most Vulnerable.

Files

Steps to reproduce

We developed a five-step method and inferred birth and death years, binary gender, and occupation from community-submitted data to all language versions of the Wikipedia project.

Institutions

Sharif University of Technology, University of Mazandaran

Categories

Demography, Gender Studies, Demography Related to Public Health, History of Demography, Age, Wikis, Health, Gender, Demographics, Country, Determinants of Health

License