Additional Dataset for "Cryptic inoviruses are pervasive in bacteria and archaea across Earth’s biomes"
This dataset includes the following data: Gb_files_inoviruses.zip: GenBank files of all representative genomes for each inovirus species. Ref_PCs_inoviruses.zip: Protein clusters from the references (raw fasta, alignment fasta, hmm profile). iPFs_inoviruses.zip: Protein families from extended inovirus dataset (raw fasta, alignment fasta, hmm profile). MobM_C_primer_amplicon.fasta: Multiple sequence alignment of the C primer products with Methanolobus MobM genome (NZ_FOUJ01000007) confirming that C primer products span the junction of the excised genome.