MarLys AMP database - MLAMP_db
Description
The MLAMP_db dataset is a comprehensive collection of antimicrobial peptide (AMP) sequences curated and integrated within the MarLys AMP platform project. To address data fragmentation and redundancy in peptide research, sequences were aggregated and standardized from thirteen primary AMP databases: AMPDB (45,408), dbAMP (34,332), DRAMP (25,523), CAMP (19,181), DBAASP (17,628), SATPdb (14,992), APD (3,129), CyBase (1,692), InverPep (770), DADP (602), CancerPPD (458), BaAMPs (199), and ParaPep (187) [accessed March 2026]. The raw data underwent rigorous cleaning, including the removal of formatting inconsistencies and sequences containing non-standard amino acids, ensuring high-quality records suitable for downstream bioinformatics analyses. The MarLys AMP platform will be accessible at http://bioinformatics.prz.edu.pl/marlys-amp.
Files
Steps to reproduce
All source code, installation guidelines, and detailed instructions required to reproduce the results are available in the GitHub repository at https://github.com/bmcode00/marlys-amp.