Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Feb 2;8(2):385-391.
doi: 10.1534/g3.117.300331.

Genome Assembly and Annotation of the Medicinal Plant Calotropis gigantea, a Producer of Anticancer and Antimalarial Cardenolides

Affiliations

Genome Assembly and Annotation of the Medicinal Plant Calotropis gigantea, a Producer of Anticancer and Antimalarial Cardenolides

Genevieve M Hoopes et al. G3 (Bethesda). .

Abstract

Calotropis gigantea produces specialized secondary metabolites known as cardenolides, which have anticancer and antimalarial properties. Although transcriptomic studies have been conducted in other cardenolide-producing species, no nuclear genome assembly for an Asterid cardenolide-producing species has been reported to date. A high-quality de novo assembly was generated for C. gigantea, representing 157,284,427 bp with an N50 scaffold size of 805,959 bp, for which quality assessments indicated a near complete representation of the genic space. Transcriptome data in the form of RNA-sequencing libraries from a developmental tissue series was generated to aid the annotation and construction of a gene expression atlas. Using an ab initio and evidence-driven gene annotation pipeline, 18,197 high-confidence genes were annotated. Homologous and syntenic relationships between C. gigantea and other species within the Apocynaceae family confirmed previously identified evolutionary relationships, and suggest the emergence or loss of the specialized cardenolide metabolites after the divergence of the Apocynaceae subfamilies. The C. gigantea genome assembly, annotation, and RNA-sequencing data provide a novel resource to study the cardenolide biosynthesis pathway, especially for understanding the evolutionary origin of cardenolides and the engineering of cardenolide production in heterologous organisms for existing and novel pharmaceutical applications.

Keywords: Apocynaceae family; Calotropis gigantea; Genome Report; cardenolide; genome assembly; pharmaceutical.

PubMed Disclaimer

Figures

Figure 1
Figure 1
C. gigantea and cardenolide metabolites. (A) C. gigantea plant. (B) Plant tissues used for RNA-sequencing libraries. 1, closed bud 2; sepals and petals; 3, peduncle and pedicel; 4, young leaf; and 5, gynostegium. (C) Mature leaf tissue used for RNA-sequencing libraries. (D) Cardenolide metabolite with the carbons numbered. Rx and Ry indicate ambiguity of the attached groups, which vary depending on the specific cardenolide.
Figure 2
Figure 2
Heterozygosity of the C. gigantea genome and assembly. K-mer frequency plots generated in Jellyfish2 (Marçais and Kingsford 2011) using ALLPATHS-LG (Gnerre et al. 2011) error-corrected genomic fragment reads.
Figure 3
Figure 3
Homologous and syntenic relationships among the Apocynaceae family. (A) Venn diagram showing the number of orthologous and paralogous groups shared among the species. The number of singletons per species is also provided. (B) Rooted cladogram for the species generated in the OrthoFinder (Emms and Kelly 2015) output.
Figure 4
Figure 4
Identification of putative C. gigantea genes involved in the cardenolide biosynthetic pathway. (A). Simplified schematic of the cardenolide biosynthetic pathway. (B and C) are neighbor-joining gene trees for 3β-hydroxysteroid dehydrogenase (3βHSD) and progesterone 5β-reductase (P5βR), respectively. Taxa for each tree are the C. gigantea candidates and functionally characterized proteins, for which the taxa labels include the GenBank identifier and species name; taxa are shaded according to their distance from the functionally characterized genes, with darker shades indicating smaller distances. Values on nodes indicate bootstrap support from 1000 bootstrap replicates. (D) Heat map of log2-transformed gene expression values (FPKM: fragments per kb exon model per million mapped reads) of candidate C. gigantea cardenolide biosynthesis genes. Cladograms were generated from conducting hierarchical clustering on the genes and samples. Blue and red colored genes are 3βHSD and P5βR candidates, respectively.

References

    1. Agrawal A. A., Petschenka G., Bingham R. A., Weber M. G., Rasmann S., 2012. Toxic cardenolides: chemical ecology and coevolution of specialized plant-herbivore interactions. New Phytol. 194: 28–45. - PubMed
    1. Amborella Genome Project , 2013. The Amborella genome and the evolution of flowering plants. Science. 342: 1241089. - PubMed
    1. Andrews, S., 2010 FastQC: a quality control tool for high throughput sequence data. Available at: http://www.bioinformatics.babraham.ac.uk/projects/fastqc.
    1. Bairoch A., Apweiler R., 2000. The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000. Nucleic Acids Res. 28: 45–48. - PMC - PubMed
    1. Bauer P., Munkert J., Brydziun M., Burda E., Müller-Uri F., et al. , 2010. Highly conserved progesterone 5β-reductase genes (P5βR) from 5β-cardenolide-free and 5β-cardenolide-producing angiosperms. Phytochemistry 71: 1495–1505. - PubMed

Publication types

MeSH terms

LinkOut - more resources