Composite Protein Database (nr) from Cephalopod Salivary Apparatus for In silico Enzymatic Digestion and Peptide Library Generation
The database includes proteins and translated transcriptomes from the posterior salivary glands (PSG) of Octopus vulgaris and 16 other cephalopods. It will be utilised to generate peptide libraries using various in silico enzymatic digestion protocols to explore peptide diversity.
Steps to reproduce
1. The previous omics datasets (Database A, C, D, E, and F) published at DOI: 10.3390/data5040110 underwent duplicate removal during the cleaning process. 2. They were concatenated and a redundancy reduction at 98% of sequence identity was applied by using CD-HIT.