Genotype data for a set of 163 worldwide populations

Published: 26 Jan 2016 | Version 1 | DOI: 10.17632/ckz9mtgrjj.1

Description of this data

Here is a combined dataset of genetic data on 2,643 individuals from 163 worldwide human populations. These genotypes were all generated on Illumina chips (550, 610, 660) for multiple different studies. The two main papers that this dataset was compiled for are: Hellenthal, et al 2014 A Genetic Atlas of Human Admixture History, Science; and Busby, et al 2015 The role of recent admixture in forming the contemporary West Eurasian genomic landscape, Current Biology.

The data are in PLINK format and the BusbyWorldwidePopulations.csv file outlines where the different datasets come from. Note that because these two datasets were combined together, not all populations are typed on the same set of SNPs. We have included genotype data on 523,443 SNPs, of which 441,038 are genotyped on at least 97.5% of individuals.

Therefore, additional QC steps are required to filter this set down to high quality calls, depending on the subset of samples that are required. Complete information about the populations used is available in the various publications that are outlined in the associated paper.

Note that these same populations are available elsewhere and this dataset represents that compiled for the above mentioned papers.

Experiment data files

Latest version


Views: 216
Downloads: 38

Previous versions

  • Version 2 (unavailable)


  • Version 1


    Published: 2016-01-26

    DOI: 10.17632/ckz9mtgrjj.1

    Cite this dataset

    Busby, George (2016), “Genotype data for a set of 163 worldwide populations”, Mendeley Data, v1

Compare to version


Natural Sciences


CC BY 4.0 Learn more

The files associated with this dataset are licensed under a Creative Commons Attribution 4.0 International licence.

What does this mean?
You can share, copy and modify this dataset so long as you give appropriate credit, provide a link to the CC BY license, and indicate if changes were made, but you may not do so in a way that suggests the rights holder has endorsed you or your use of the dataset. Note that further permission may be required for any content within the dataset that is identified as belonging to a third party.