MVRSION validation data
MVRSION is a method that identifies and quantifies members of bacterial communities through simultaneous analysis of multiple variable regions of the bacterial 16S rRNA gene. The MVRSION method combines multiplexed PCR with high-throughput microfluidics, Illumina sequencing and a novel bioinformatic algorithm. Included in this archive is the data used to validate the microbial taxonomic calls made by MVRSION. The file validation_knowns.txt contains the known relative abundance of each bacterial species per sample. The samples named BEI and 48G are mixtures of bacterial DNA where the relative abundances are known exactly. The rest of the samples were collected from stool samples of gnotobiotic mice where the relative abundances were estimated using COPRO-seq. DNA from each of the 97 validation samples was amplified with a panel of 14 PCR primer pairs targeting all nine hypervariable regions of the 16S gene using a Fluidigm Access Array. The primer sequences are available in the file mvrsion_16s_primers. The sequencing libraries generated by the Access Array were sequenced on two 2x150 base pair MiSeq runs. The file fastq_files.tar contains the raw fastqs included in the validation from the two MiSeq runs.