Data on genetic diversity of circumsporozoite protein (csp) non-repeat regions from Plasmodium knowlesi clinical isolates of Sabah.

Published: 11 April 2022| Version 1 | DOI: 10.17632/byxfrkf7gp.1
Zarina Amin


This dataset presents an analysis genetic diversity of malaria circumsporozoite protein (csp) of Plasmodium knowlesi in Sabah; where circumsporozoite protein is one of the targeted candidates for malaria vaccine development and was conducted to evaluate the suitability of csp as a vaccine in relation to its genetic diversity. The data were collected from 26 human blood spot samples from Kudat and Kota Kinabalu hospitals in Sabah in 2012 which were tested positive for malaria. Genomic DNA extraction, nested PCR, cloning and sequencing of the csp genes were carried out and phylogenetic, sequence diversity and natural selection of the csp genes were analysed using bioinformatic tools such as MEGAX and DnaSP ver. 5.10.00 for phylogenetic tree build, mutational analysis and neutral theory of evolution. Analysis and comparison of this gene was done against P. knowlesi csp strain H as a reference sequence (GenBank database XM_002258966.1) showed point mutations at 52 positions among the 237 sequences of different geographical regions. The phylogenetic tree revealed that the occurrence of multiple haplotypes was scattered despite of geographical location. The evolutionary history which was inferred using the Neighbor-Joining method revealed no geographical clustering to any country listed above; with a total of 76 non-repeat region Pkcsp haplotypes including one unique haplotype (haplotype H12). These data could serve as auxiliary information and/or research data for other researchers in Sabah. It could also serve as guide or reference data to other researchers outside Sabah who may be interested in carrying out similar research in other states.


Steps to reproduce

Data was acquired via Nested PCR of genomic DNA from blood spots of 26 human blood samples. Molecular cloning and sequencing were then carried out of the samples followed by csp gene sequence alignment and finally analysis of the csp genes were carried out for phylogeny, sequence diversity and natural selection via Bioinformatics tools which included MEGAX’s Neighbour Joinign Technique and DnaSP ver 5.10.00.


Universiti Malaysia Sabah


Infectious Disease