Transcriptomic profiles datasets of oil-producing leaves from Dryobalanops aromatica and Dryobalanops lanceolata
The transcriptomic data for both D. aromatica and D. lanceolata was generated using the Illumina HiSeq4000 platform in NovogenAIT, Singapore. The data is beneficial for comparative transcriptome studies to understand more the molecular mechanism differences as to the oil yielding from leaves of the two species which showed the widest genetic distances within the genus.
Steps to reproduce
The leave samples were collected from D. aromatica tree planted in Gunung Dahu Research Forest, Bogor, and D. lanceolata sapling grown in Forest Research and Development Center, Bogor. Sequencing D. aromatica and D. lanceolata leaves were performed using Illumina HiSeq 4000 platform in NovogenAIT, Singapore with paired-end method and examined using FastQC. The data filtration using Fastp v0.19.5 to remove low-quality reads and adaptor with default parameters. De novo assembly process in this study used Trinity v2.9.1 and rnaSPAdes v3.9.0 assembler. The statistical data of the de novo assembly were then calculated using QUAST and assembly completeness was verified using BUSCOs v5.0.0. Based on the assembly result and the completeness of the assemblies, the Trinity assembler was selected for the functional annotation process. The contigs were used to identify ORFs using TransDecoder v5.5.0 and Blast X. Sequence data with the longest ORFs was used in BlastP search, profille collection (hmmscan), signal peptide (SignalP 3.0), and transmembrane domain (TMHMM 2.0). Search for homologous Protein and DNA using Diamond v2.0.8 software with BlastP and BlastX. Methods from the Uniprot daabase (downloaded July 15, 2021). Functional annotation are performed with Trinotate v.3.2.1 to provide information on ORFs data. following the de novo transcritome assembly, annotation, and differential expression analysis protocol.