Antimicrobial Peptides Predicted from Omics Data of Cephalopods' Posterior Salivary Glands
This dataset compiles antimicrobial peptides (AMPs) consistently identified within omics data extracted from Cephalopods' Posterior Salivary Glands (PSG). It comprises consensus AMPs that were predicted across the 13 AMP datasets reported in (doi: 10.17632/wwk7zzcfhv.1). These latter datasets stemmed from agreed AMP predictions by three machine learning models on peptide libraries generated via 13 distinct in silico enzymatic digestion protocols applied to omics data extracted from Cephalopods' PSG (doi: 10.17632/6fjsdnvygb.1). This dataset holds promise as a potential reservoir of novel AMPs, suitable for further exploration and screening to uncover peptide candidates for drug development.
Steps to reproduce
The creation of this dataset involved two key steps: 1. The initial peptide libraries, accessible via doi: 10.17632/wwk7zzcfhv.1, were combined. 2. To eliminate sequence redundancy, CD-HIT was utilized with a sequence identity threshold set at 0.98