Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Mar 4:47:109029.
doi: 10.1016/j.dib.2023.109029. eCollection 2023 Apr.

Complete chloroplast genome data of Shorea macrophylla (Engkabang): Structural features, comparative and phylogenetic analysis

Affiliations

Complete chloroplast genome data of Shorea macrophylla (Engkabang): Structural features, comparative and phylogenetic analysis

Ivy Yee Yen Chew et al. Data Brief. .

Abstract

Shorea macrophylla belongs to the Shorea genus under the Dipterocarpaceae family. It is a woody tree that grows in the rainforest in Southeast Asia. The complete chloroplast (cp) genome sequence of S. macrophylla is reported here. The genomic size of S. macrophylla is 150,778 bp and it possesses a circular structure with conserved constitute regions of large single copy (LSC, 83,681 bp) and small single copy (SSC, 19,813 bp) regions, as well as a pair of inverted repeats with a length of 23,642 bp. It has 112 unique genes, including 78 protein-coding genes, 30 tRNA genes, and four rRNA genes. The genome exhibits a similar GC content, gene order, structure, and codon usage when compared to previously reported chloroplast genomes from other plant species. The chloroplast genome of S. macrophylla contained 262 SSRs, the most prevalent of which was A/T, followed by AAT/ATT. Furthermore, the sequences contain 43 long repeat sequences, practically most of them are forward or palindrome type long repeats. The genome structure of S. macrophylla was compared to the genomic structures of closely related species from the same family, and eight mutational hotspots were discovered. The phylogenetic analysis demonstrated a close relationship between Shorea and Parashorea species, indicating that Shorea is not monophyletic. The complete chloroplast genome sequence analysis of S. macrophylla reported in this paper will contribute to further studies in molecular identification, genetic diversity, and phylogenetic research.

Keywords: Chloroplast genome; Dipterocarpaceae; Monophyletic; Phylogenetic analysis; Shorea macrophylla.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Fig 1
Fig. 1
Gene map of S. macrophylla chloroplast genome. The genes inside and outside the outer circle are transcribed in clockwise and anticlockwise directions. Genes belonging to different functional groups are coloured with different colours. The inner circle indicates the four regions of the chloroplast genome. IRa, inverted repeat region A; IRb, inverted repeat region B; LSC, large single-copy region; SSC, small single-copy region. The line chart in grey colour shows GC content along the genome.
Fig 2
Fig. 2
Comparison of the borders of the LSC, SSC and IR regions among the eight chloroplast genomes.
Fig 3
Fig. 3
Nucleotide diversity analysis of the whole chloroplast genome. Window length: 600 bp; Step size: 200 bp. X-axis: midpoint positions of a window; Y-axis: nucleotide diversity between S. macrophylla and five other Shorea and two Parashorea species.
Fig 4
Fig. 4
The shuffle-LAGAN program analyzed the comparative analysis of S. macrophylla with Shorea and Parashorea species. The percentage of identity is shown on the vertical axis, which ranges from 50 to 100%, while the horizontal axis represents the position in the chloroplast genome. Each arrow indicates the annotated gene in the reference genome and the direction of its transcription. Genomic regions are color-coded into exons(purple), UTR (neon blue) and CNS (pink).
Fig 5
Fig. 5
Phylogenetic relationships of S. macrophylla with other Dipterocarpaceae family species and two outgroups based on their protein-coding genes. The bootstrap value based on 1000 replicates is shown on each node. The subclades are drawn with Shorea sp. subclade drawn in red colour and Parashorea sp. subclade drawn in green colour. The result shown Shorea genus is not monophyletic.

References

    1. Chai E.O.K. University of Edinburgh; Malaysia: 1998. Aspects of a Tree Improvement Programme for Shorea Macrophylla (de Vriese) Ashton in Sarawak.
    1. Ashton P.S. Dipterocarpaceae. Flora Malesiana. 1982;9:237–552.
    1. Yu Y., Han Y., Peng Y., Tian Z., Zeng P., Zong H., Zhou T., Cai J. Comparative and phylogenetic analyses of eleven complete chloroplast genomes of Dipterocarpoideae. Chinese Med. (United Kingdom) 2021;16:1–15. doi: 10.1186/s13020-021-00538-8. - DOI - PMC - PubMed
    1. Alzahrani D., Albokhari E., Abba A., Yaradua S. The first complete chloroplast genome sequences in Resedaceae: Genome structure and comparative analysis. Sci. Prog. 2021;104:1–18. doi: 10.1177/00368504211059973. - DOI - PMC - PubMed
    1. Qian J., Song J., Gao H., Zhu Y., Xu J., Pang X., Yao H., Sun C., Li X., Li C., Liu J., Xu H., Chen S. The Complete Chloroplast Genome Sequence of the Medicinal Plant Salvia miltiorrhiza. PLoS One. 2013;8 doi: 10.1371/journal.pone.0057607. - DOI - PMC - PubMed