Data for: Diatom genes originating from red and green algae: implications for the secondary endosymbiosis models
A dataset used for establishing whether diatom genes are closer to red algae or green ones. Based on genomes of P. tricornutum, Th. pseudonana, S. acus subsp. radians, F. solaris, F. cylindrus and P.-n. multiseries, as well as all available diatom transcriptomes from MMETSP. The dataset includes ML and UPGMA trees, Likelihood mapping raw data, alignments used for the above and the results of DIAMOND search of diatom predicted proteins against NCBI nr.