Dataset for metagenomic analysis of bacterial community grown in sediments contaminated with pulp paper mill waste pollutants
Pulp and paper mills (PPMs) generate complex and recalcitrant chlorolignin pollutants in their waste stream, posing a significant environmental and health threat. To understand how microbial communities can help in the clean-up of chlorolignin waste discharged from PPMs, two sediment samples, i.e., PPS-1 and PPS-2, were collected from two sites (site-1 and site-2) to examine chemical contaminants and their effects on indigenous bacterial communities. Physicochemical and GC-MS analysis of sediment samples revealed the presence of heavy and various organic refractory and toxic pollutants. Simultaneously, high-throughput sequencing of the V3–V4 hypervariable region of the 16S rRNA genes of metagenome extracted from sediment samples, was conducted on the Illumina MiSeq platform. This dataset consists of raw sequencing data (fastq) analysis reports, pie charts, and barplots. In addition, it also consists of the analyzed rarefaction curve, heat map, and Krona chart. Taxonomic analysis of metagenomic sequence data revealed that Proteobacteria dominated both PPS-1 and PPS-2 at the phylum level. At the genus level, Thiobacillus was the most prevalent in sediment samples. These results highlight substantial differences in bacterial community diversity and taxonomic abundance between PPS-1 and PPS-2. This research offers valuable insights into the characteristics of contaminants, shifts in microbial communities, and their potential functions in chlorolignin waste-polluted sites.