Original data of chloroplast genomes of 3 Ulmus pumila cultivars ('Xu Ri', 'Yang Gang', and 'Zhao Yang')

Published: 10 March 2026| Version 1 | DOI: 10.17632/w3v49764zv.1
Contributor:
Ning Wang

Description

Research Hypothesis The central hypothesis of this study is that the chloroplast genomes of 3 Ulmus pumila cultivars—' 'Xu Ri', 'Yang Gang', and 'Zhao Yang'—harbor sufficient genetic variation to distinguish them at the molecular level. Data Description The dataset consists of raw Illumina paired-end sequencing data (FASTQ format) from the chloroplast genomes of 3 Ulmus pumila cultivars. Fresh leaf tissue was collected for each cultivar, and total genomic DNA was extracted. Chloroplast DNA was enriched through a combination of whole genome sequencing and subsequent bioinformatic filtering, or via long-range PCR amplification. Sequencing was performed on an Illumina platform (e.g., NovaSeq 6000) with 2 × 150 bp read lengths. Each sample generated approximately 2–4 Gb of raw data, ensuring deep coverage for de novo assembly. The raw FASTQ files are provided per cultivar and contain unprocessed reads with quality scores suitable for downstream analysis. Data Interpretation and Usage The data can be interpreted and utilized in several ways: Cultivar identification: Identified SNPs and SSRs can be developed into molecular markers for rapid and accurate discrimination of U. pumila cultivars, aiding in nursery quality control and intellectual property protection. Breeding and conservation: Insights into genetic diversity and relatedness among cultivars can inform cross-breeding strategies for improved traits and guide conservation of elite germplasm. Comparative genomics: Alignments among cultivars will reveal mutation hotspots and structural variants, contributing to understanding chloroplast genome evolution under domestication and selection. Raw data serves as a valuable resource for testing assembly algorithms and teaching plant organellar genomics. Researchers should process raw reads using standard QC tools (FastQC, Trimmomatic) followed by assembly with GetOrganelle or NOVOPlasty. Assemblies should be validated by read mapping and coverage analysis. The data are shared to promote reproducibility and enable broader comparative studies within Ulmus.

Files

Institutions

Categories

Genome

Licence