LdHU3_protein coding sequences

Published: 3 March 2025| Version 2 | DOI: 10.17632/wnrm2hk8gs.2
Contributors:
Jose M. Requena,

Description

This dataset contains sequences (fasta file) and genomic coordinates (gff file) for the protein-coding (CDS) sequences annotated in the Leishmania donovani (HU3 strain) genome. These data are based on the initial annotations carried out by Camacho et al (2019), with posterior improvements included after the studies by Sánchez-Salvador et al (2023) and Adán-Jiménez et al. (2024). The archive gff should be explored using a genome visualizer like IGV (https://igv.org/) and the L. donovani (HU3) genome (Mendeley data: LdHU3_Genome sequence; https://data.mendeley.com/datasets/b82fm2w2h9/2) REFERENCES - Camacho, E., Gonzalez-de la Fuente, S., Rastrojo, A., Peiro-Pastor, R., Solana, J.C., Tabera, L., Gamarro, F., Carrasco-Ramiro, F., Requena, J.M., and Aguado, B. (2019). Complete assembly of the Leishmania donovani (HU3 strain) genome and transcriptome annotation. Sci Rep 9, 6127. PMID: 30992521. - Sánchez-Salvador, A., González-de la Fuente, S., Aguado, B., Yates, P.A., and Requena, J.M. (2023). Refinement of Leishmania donovani Genome Annotations in the Light of Ribosome-Protected mRNAs Fragments (Ribo-Seq Data). Genes (Basel) 14, 1637. https://pubmed.ncbi.nlm.nih.gov/37628688 - Adán-Jiménez, J., Sánchez-Salvador, A., Morato, E., Solana, J. C., Aguado, B. and Requena, J. M. (2024). A Proteogenomic Approach to Unravel New Proteins Encoded in the Leishmania donovani (HU3) Genome. Genes 15, 775. https://pubmed.ncbi.nlm.nih.gov/38927711

Files

Steps to reproduce

See description (above)

Institutions

Universidad Autonoma de Madrid

Categories

Genome, Protein Annotation, Leishmania, Coding (DNA)

Funding

Agencia Estatal de Investigación

PID2020-117916RB-I00/AEI/10.13039/501100011033

Licence