Accompanying dataset to the study "Natural CRISPR systems and targets in the human microbiome"

Published: 26 August 2020| Version 1 | DOI: 10.17632/bsmmy8pwrt.1
Contributors:
,
,
,
,

Description

This repository hosts data products resulting from the CRISPR and cas gene study of the Human Microbiome Project (HMP1-II) population, consisting of spacer and repeat sequences (hmp1-II-crispr-spacers, hmp1-II-crispr-repeats) recovered from shotgun metagenomes using a read-based approach (Crass, Skennerton et al, 2013). The functional and taxonomic spacer annotations were obtained by aligning the spacer content to each sample’s metagenome and corresponding UniRef90 collection (hmp1-II-crispr-spacers-annotation). We further provide the UniRef90 profiling of the 2,355 metagenomes of the HMP1-II collection that was used as the basis of the Cas1-Cas12 analysis (Files hmp1-II_uniref90 and hmp1-II_uniref90_cas). We used a custom HUMAnN 2 database to subset relevant protein families (humann2_cas_mapping).

Files

Categories

Metagenomics, CRISPR

Licence