Context embeddings trained on short sequence FASTA files for human alternative splicing
Published: 26 April 2023| Version 2 | DOI: 10.17632/ybz8tcvnv9.2
Contributor:
Daniel UmDescription
Data to generate "FIGURE 5: Context embeddings trained on short sequence FASTA files for human alternative splicing – n = 10,541" for "Vector Embeddings by Sequence Similarity and Context for Improved Compression, Similarity Search, Clustering, Organization, and Manipulation of cDNA Libraries" by Daniel H. Um, et al.
Files
Steps to reproduce
Use Context_Embeddings_3D_Plot_Generation_Script.ipynb on alternative_splicing_human_10541.fasta to generate 3D_plot_FASTA_files_for_human_alt_splicing.png.
Institutions
Columbia University
Categories
Bioinformatics, Machine Learning