DLAVP Dataset

Published: 6 June 2023| Version 1 | DOI: 10.17632/ns7vrm6h8j.1
Contributor:
Ruifen CAO

Description

2762 AVPs and 10095 non-AVPs.

Files

Steps to reproduce

In order to verify the generalization of the model, a new dataset is constructed in the study, which contains all the new data we can search. With the development of antiviral peptide research, in addition to the previous data-base of VIRsiRNAdb17, CAMP18, and APD219, there are more and more AVP databases and new data can be used. Pang et al.20 also made a new collection of AVP data sets in the study of antiviral peptides. They collected antiviral peptides from dbAMP21, AVPdb22, DRAMP23, DBAASP24 and HIPdb25 databases, and the length of these data ranged from 8 to 150 standard amino acids. A total of negative data sets, mainly from dbAMP21, DBAASP24, DRAMP23 and UniProt26 databases. We supplemented Pang’s data set by collecting AVPs from CAMPR327, LAMP, APD3 and DRAMP23 databases. We finally obtained 2762 AVPs and 10095 non-AVPs. In order to ensure the balance of positive and negative samples, 2762 sequences were randomly selected from non-AVPs dataset. Finally, these data serve as a new data set in this study.

Institutions

Anhui University

Categories

Virus Peptides

Licence