Submitted data includes R code and associated metadata files. Raw data is included.
Contributors:Joseph Donfack, Brian Eckenrode, Traci Carlson, Katherine Jones
Sequences of 15 keratin proteins (i.e., KRT31-36, KRT38-KRT40 and KRT81-84 and KRT86) were downloaded from Uniprot database (version 2016.05.10) and edited to generate a variant keratin protein sequence database. Each variant protein sequence contained a single SAP derived from a nsSNP with a frequency >0.1% in populations of European and African American descent, as listed in the National Center for Biotechnology Information (NCBI) SNP database (http://evs.gs.washington.edu/EVS/).
Contributors:Elaine Y.Y. Cheung, Mayra Eduardoff, Dennis McNevin
Supplementary File S2. R script for selecting the top ancestry-informative markers based on the calculation of population differentiation potential using various metrics.