Phylogenomic and Evolutionary Insights into Lipoprotein Lipase Genes (lpl) in Tambaqui: Gene Duplication, Tissue-Specific Expression, and Physiological Implications
Description
This dataset includes two tables: (1) Predicted N-glycosylation sites in LPL/Lpla/Lplb sequences across vertebrates, and (2) Predicted N-glycosylation sites in LPL/Lpla/Lplb sequences across vertebrates, including Lpla and Lplb from tambaqui and other teleosts. Any potential crossing the default threshold of 0.5, represents a predicted glycosylated site (as long as it occurs in the required sequon Asn-Xaa-Ser/Thr without Proline at Xaa). Sites with + Potential (< 0.5) are highlighted in yellow, ++ Potential (< 0.5) are highlighted in green and +++ Potential (< 0.75) are highlighted in blue. non-glycosylated sites - Potential (< 0.5) are highlighted in red
Files
Steps to reproduce
Protein sequences of LPL, Lpla, Lplb, LPL2, Lpl2a and Lpl2b from vertebrates were retrieved from [NCBI/Ensembl] and formatted in FASTA format for analysis. The sequences were then submitted to the NetNGlyc 1.0 web server (http://www.cbs.dtu.dk/services/NetNGlyc/) using default parameters, which predict potential N-glycosylation sites based on the consensus motif Asn-X-Ser/Thr (N-X-S/T). The threshold score for glycosylation site prediction was applied according to the tool’s default settings. Sites with a positive prediction score were considered putative N-glycosylation sites, and only those exceeding the high-confidence threshold (as defined by NetNGlyc) were retained for further analysis. The predicted N-glycosylation sites were then compared among LPL, Lpla, and Lplb sequences across vertebrates. Conservation of glycosylation sites was assessed using multiple sequence alignment with MUSCLE. The final dataset was compiled into a table, including species names, taxonomic groups, glycosylation site positions, and prediction scores.