Open Reading Frames (ORFs) and annotations from the paired-end transcriptome outputs of Savalia savaglia's RNASeq data

Published: 10 October 2024| Version 4 | DOI: 10.17632/ycrmntp78w.4
Contributors:
Dany Domínguez Pérez,
,
,
,

Description

This dataset contains Open Reading Frames (ORFs) and annotations obtained from the paired-end transcriptome outputs of Savalia savaglia's RNASeq data. The dataset includes the following files: From TransDecoder analyses: • Assembly_Ss_PE.Trinity.fasta.transdecoder.cds: Nucleotide sequences for coding regions of the final candidate ORFs, obtained with TransDecoder from paired-end transcriptome outputs of Savalia savaglia's RNA-Seq. • Assembly_Ss_PE.Trinity.fasta.transdecoder.gff3: Positions within the target transcripts of the final selected ORFs, obtained with TransDecoder from paired-end transcriptome outputs of Savalia savaglia's RNA-Seq. • Assembly_Ss_PE.Trinity.fasta.transdecoder.pep: Peptide sequences for the final candidate ORFs, obtained with TransDecoder from paired-end transcriptome outputs of Savalia savaglia's RNA-Seq. • Assembly_Ss_PE.Trinity.fasta.transdecoder.bed: BED-formatted file describing ORF positions, suitable for viewing using GenomeView or IGV, obtained from paired-end transcriptome outputs of Savalia savaglia's RNA-Seq data. • blastp.outfmt6.w_pct_hit_length: File providing percentages of hit lengths from BLASTp results of the paired-end de novo assembly of Savalia savaglia. It includes the top hit's length and the percentage of the length covered in the alignment. • pfam.domtblout: PFAM domain annotations for the predicted proteins in the paired-end de novo assembly of Savalia savaglia. From Trinotate analyses: • myTrinotate_PE_Ss.tsv: Comprehensive annotation file with results from Trinotate, including protein domain identification and other annotations from the paired-end de novo assembly of Savalia savaglia. • Trinotate_PE_Ss_report.cXp_summary.html: HTML report summarizing the annotation results from Trinotate, providing an overview of the functional annotations and transcript features of the paired-end de novo assembly of Savalia savaglia.

Files

Steps to reproduce

ORFs and annotation were generated by TransDecoder v5.7.1 and Trinotate v4.0.2 using the paired-end de novo assembly of the false coral Savalia savaglia.

Institutions

Stazione Zoologica Anton Dohrn

Categories

Transcriptomics, Protein Annotation, Sequence Analysis

Funding

This work was supported by Centro Ricerche ed Infrastrutture Marine Avanzate in Calabria (CRIMAC) - Fondo FSC 2014-2020 - Piano Stralcio «Ricerca e Innovazione 2015-2017» – Programma Nazionale Infrastrutture di Ricerca (PNIR), CUP C64I20000320001.

Licence