MarLys AMP database - MLAMP_db

Published: 9 March 2026| Version 3 | DOI: 10.17632/w4hb5grjwb.3
Contributors:
,
,

Description

The MLAMP_db dataset is a comprehensive collection of antimicrobial peptide (AMP) sequences curated and integrated within the MarLys AMP platform project. To address data fragmentation and redundancy in peptide research, sequences were aggregated and standardized from thirteen primary AMP databases: AMPDB (45,408), dbAMP (34,332), DRAMP (25,523), CAMP (19,181), DBAASP (17,628), SATPdb (14,992), APD (3,129), CyBase (1,692), InverPep (770), DADP (602), CancerPPD (458), BaAMPs (199), and ParaPep (187) [accessed March 2026]. The raw data underwent rigorous cleaning, including the removal of formatting inconsistencies and sequences containing non-standard amino acids, ensuring high-quality records suitable for downstream bioinformatics analyses. The MarLys AMP platform will be accessible at http://bioinformatics.prz.edu.pl/marlys-amp.

Files

Steps to reproduce

All source code, installation guidelines, and detailed instructions required to reproduce the results are available in the GitHub repository at https://github.com/bmcode00/marlys-amp.

Categories

Bioinformatics, Antimicrobial Peptide

Licence