Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Nov 7;120(45):e2308569120.
doi: 10.1073/pnas.2308569120. Epub 2023 Nov 2.

Massive invasion of organellar DNA drives nuclear genome evolution in Toxoplasma

Affiliations

Massive invasion of organellar DNA drives nuclear genome evolution in Toxoplasma

Sivaranjani Namasivayam et al. Proc Natl Acad Sci U S A. .

Abstract

Toxoplasma gondii is a zoonotic protist pathogen that infects up to one third of the human population. This apicomplexan parasite contains three genome sequences: nuclear (65 Mb); plastid organellar, ptDNA (35 kb); and mitochondrial organellar, mtDNA (5.9 kb of non-repetitive sequence). We find that the nuclear genome contains a significant amount of NUMTs (nuclear integrants of mitochondrial DNA) and NUPTs (nuclear integrants of plastid DNA) that are continuously acquired and represent a significant source of intraspecific genetic variation. NUOT (nuclear DNA of organellar origin) accretion has generated 1.6% of the extant T. gondii ME49 nuclear genome-the highest fraction ever reported in any organism. NUOTs are primarily found in organisms that retain the non-homologous end-joining repair pathway. Significant movement of organellar DNA was experimentally captured via amplicon sequencing of a CRISPR-induced double-strand break in non-homologous end-joining repair competent, but not ku80 mutant, Toxoplasma parasites. Comparisons with Neospora caninum, a species that diverged from Toxoplasma ~28 mya, revealed that the movement and fixation of five NUMTs predates the split of the two genera. This unexpected level of NUMT conservation suggests evolutionary constraint for cellular function. Most NUMT insertions reside within (60%) or nearby genes (23% within 1.5 kb), and reporter assays indicate that some NUMTs have the ability to function as cis-regulatory elements modulating gene expression. Together, these findings portray a role for organellar sequence insertion in dynamically shaping the genomic architecture and likely contributing to adaptation and phenotypic changes in this important human pathogen.

Keywords: Coccidia; non-homologous end-joining repair—NHEJ; nuclear DNA of organellar origin—NUOT; nuclear integrants of mitochondrial DNA—NUMTs; nuclear integrants of plastid DNA—NUPTs.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interest.

Figures

Fig. 1.
Fig. 1.
Characteristics of NUMTs and NUPTs in T. gondii ME49 (A) Cladogram of apicomplexan parasite relationships (B) Circos plot (Circos version 0.51) represents the distribution of NUMTs and NUPTs in the nuclear chromosomes of T. gondii ME49. The outer circle of yellow bands indicates the chromosomes as labeled, and each tick on the band equals 100 kb. The ticks interior to the chromosomes are different features as indicated in the key. (C) The length and the percent identity of a NUMT and NUPT to mt and ptDNA, respectively, were calculated from RepeatMasker results, and the distribution is plotted. (D) The age of each NUMT and NUPT was calculated based on the percent divergence from the mt and ptDNA of the corresponding species, and the distribution of the age is plotted against the number of base pairs.
Fig. 2.
Fig. 2.
Strain-specific presence and absence of NUMTs in T. gondii. (A and B) A likely strain-specific insertion in T. gondii (TG) GT1 (A) and deletion in ME49 (B) are shown. A multiple sequence alignment of these loci was performed for strains GT1, VEG, and ME49, and the co-ordinates of the loci are indicated with reference with GT1 chromosome VI. The NUMT and flanking nuclear sequence are indicated in red and black, respectively. (C) NUMTs in 16 T. gondii strains were compared to identify a strain-specific presence and absence. Strains are grouped and colored by clades as previously identified (7). The 14 chromosomes divided by black lines are represented on the x-axis. NUMTs that show differential presence/absence in the 16 T. gondii strains are numbered and ordered based on their location in ME49. The coordinates of each NUMT plus 200-bp flanking region on the ME49 chromosomes are shown at the bottom. The 200-bp regions were included to provide a location for NUMTs missing in ME49. The percent divergence of the NUMT from the mtDNA sequence is indicated above the coordinates. In the matrix, “+” and “−” indicate the presence and absence of a NUMT in a particular strain, respectively. Each cell is colored based on local clade ancestry of that location (7) although it does not imply that location originated from that clade. Yellow cells indicate clade E ancestry and were not included the NUMT analysis as an assembled genome sequence for a strain from this clade was not available.
Fig. 3.
Fig. 3.
Cis-regulatory activity of two selected NUMTs. (A and B) The structure of WT and NUMT deletion promoter constructs for U6 snRNA-associated Sm family protein gene TGME49_286560 (A) and myosin heavy chain gene TGME49_254850 (B). Nucleotide positions are referenced with respect to the start of translation (+1) of host gene, and the red region indicates the position of the NUMT (SI Appendix, Fig. S4). (C and D) Reporter assay results for the promoter of Sm-like protein (C) and myosin heavy chain (D) genes. The graphs depict luciferase activity as ratios of Firefly:Renilla activity in relative luciferase units (RLU) from the different constructs containing either WT or mutagenized promoter. All luciferase readings are relative to an internal control (α-tubulin-Renilla). Error bars represent SE calculated across the means of six independent electroporations. P-values were calculated using Student’s t-test.
Fig. 4.
Fig. 4.
Capture of DNA insertions at CRISPR/Cas9-directed DSB at the uprt locus in wild-type and Δku80 parasites. Utilizing the CRISPR-Cas9 system, DSBs were induced at the uprt locus in wild-type T. gondii and Δku80 parasites. The locus was then amplified and deep-sequenced to evaluate the extrachromosomal DNA insertions (mtDNA, ptDNA, and CRISPR-Cas9 plasmid DNA) at the DSB site. (A) Apicomplexan cladogram with the presence (+) or absence (−) of key NHEJ pathway genes and NUOT content in each species indicated. The coccidian branch is shown in bold. (B and C) The percent of amplicons with insertions from each DNA type (B) and the percentage of base pairs from each type present in the amplicons (C) are plotted for WT and Δku80 parasites. The amplicons analyzed comprise all read pairs that could be merged as well as unmerged reads. Error bars represent SE of means of three replicates. Statistical significance was calculated using Student’s t-test (*P < 0.05, **P < 0.01, ***P < 0.001). (D and E) Distribution of the insert length for each DNA type is plotted as a proportion (D) and as box plots (E). The data were generated from merged read pairs for one WT replicate. Color key is as shown in (B). Statistical significance was calculated using Student’s t-test (****P < 0.0001). (F) Schematic representation of the modifications observed in the amplicon reads for WT parasites. The numbers under “WT amplicon” indicate the length of the amplicon with the inverted triangle denoting the approximate site of the DSB. The type of DNA inserted is indicated in the key in (A). Not all combinations of insertion patterns observed in the reads are represented. Examples of reads with the kind of modifications shown in this schematic are provided in SI Appendix, Table S12. Amplicons with WT sequence (yellow) on both ends represent reads where the pairs could be merged. Numerous unmerged read with similar insertion patterns as shown in this figure exist but are not drawn here.

Update of

References

    1. Tenter A. M., Heckeroth A. R., Weiss L. M., Toxoplasma gondii: From animals to humans. Int. J. Parasitol. 30, 1217–1258 (2000). - PMC - PubMed
    1. Dubey J. P., Toxoplasmosis of Animals and Humans (CRC Press, ed. 3, 2021).
    1. Robert-gangneux F., Darde M. L., Epidemiology of and diagnostic strategies for toxoplasmosis. Clin. Microbiol. Rev. 25, 264–296 (2012). - PMC - PubMed
    1. Belanger F., Derouin F., Grangeot-keros L., Meyer L., Incidence and risk factors of toxoplasmosis in a cohort of human immunodeficiency virus-infected patients: 1988–1995. HEMOCO and SEROCO Study Groups. Clin. Infect. Dis. 28, 575–581 (1999). - PubMed
    1. Gilbert R. E., et al. , Effect of prenatal treatment on mother to child transmission of Toxoplasma gondii: Retrospective cohort study of 554 mother-child pairs in Lyon, France. Int. J. Epidemiol. 30, 1303–1308 (2001). - PubMed

Substances