Distribution and phasing of sequence motifs that facilitate CRISPR adaptation. Santiago-Frangos et al
Description
Data S1. Position weight matrices of subtype I-C, I-E, I-F and II-C CRISPR leader motifs and I-F CRISPR repeat, Related to STAR methods, Table 1 and Figures 2, 3, 4, S1, S2, S5. (A) I-EA motif. (B) I-EB motif. (C) I-E IHF binding site. (D) I-E leader anchoring motif. (E) I-E Upstream motif. (F) I-F IHF binding site. (G) I-F CRISPR repeat. (H) I-F Upstream motif. (I) I-C Upstream motif. (J) II-C Upstream motif. Data S2. Summary of significant matches to motifs found in CRISPR leaders, Related to STAR methods, Table 1 and Figures 2, 3, S1, S5. (A) Significant matches to position weight matrices in all CRISPR leaders analyzed. (B) Tally of the Phyla that I-E, I-F, I-C and II-C CRISPRs originate from and relation to leader motifs. (C) IHF-regulated CRISPR loci predicted to rapidly acquire new spacers. (D) Summary of leader motifs found in Campylobacter and Neisseria spp. II-C CRISPR loci. Data S3. Analysis of I-E leader motifs found adjacent to non-canonical integration sites in vivo in E. coli, Related to STAR methods and Figures 3 and S5. (A) Identification of I-E leader motifs and repeat motifs in sequences flanking both sides of non-canonical integration sites. (B) Summary of top 25 unique integration sites across 4 biological replicates reported in Nivala et al., 2018.