Comparative Study

. 2005 May;15(5):616-28.

doi: 10.1101/gr.3788705.

Generation and evolutionary fate of insertions of organelle DNA in the nuclear genomes of flowering plants

Christos Noutsos¹, Erik Richly, Dario Leister

Affiliations

PMID: 15867426
PMCID: PMC1088290
DOI: 10.1101/gr.3788705

Comparative Study

Generation and evolutionary fate of insertions of organelle DNA in the nuclear genomes of flowering plants

Christos Noutsos et al. Genome Res. 2005 May.

. 2005 May;15(5):616-28.

doi: 10.1101/gr.3788705.

Authors

Christos Noutsos¹, Erik Richly, Dario Leister

Affiliation

¹ Abteilung für Pflanzenzüchtung und Genetik, Max-Planck-Institut für Züchtungsforschung, D-50829 Köln, Germany.

PMID: 15867426
PMCID: PMC1088290
DOI: 10.1101/gr.3788705

Abstract

Nuclear genomes are exposed to a continuous influx of DNA from mitochondria and plastids. We have characterized the structure of approximately 750 kb of organelle DNA, distributed among 13 loci, in the nuclear genomes of Arabidopsis and rice. These segments are large and migrated to the nucleus quite recently, allowing us to reconstruct their evolution. Two general types of nuclear insertions coexist; one is characterized by long sequence stretches that are colinear with organelle DNA, the other type consists of mosaics of organelle DNA, often derived from both plastids and mitochondria. The levels of sequence divergence of the two types exclude their common descent, implying that at least two independent modes of DNA transfer from organelle to nucleus operate. The post-integration fate of organelle DNA is characterized by a predominance of transition mutations, associated with the gradual amelioration of the integrated sequence to the nucleotide composition of the host chromosome. Deletion of organelle DNA at these loci is essentially balanced by insertions of nonorganelle DNA. Deletions are associated with the removal of DNA between perfect repeats, indicating that they originate by replication slippage.

PubMed Disclaimer

Figures

**Figure 1.**
Structure of long continuous insertions of ptDNA in the nuclear genome of *O. sativa*. (A) Structure of rice ptDNA (Os-pt). The position of the long (LSC) and short (SSC) single-copy regions, as well as of the two inverted repeats (IR_A and IR_B) are indicated. The four nuclear insertions (see *bottom*) are depicted as black lines below their regions of origin in the rice plastid chromosome. When the origin of a fragment of nuclear ptDNA was ambiguous, i.e., when derived from repetitive regions, it was assigned to the repeat located most 5′ of the organellar DNA. For instance, NUPTs solely homologous to IR-specific sequences were always assigned to IR_B.(B) Structure of the four nuclear insertions of rice ptDNA , , and . Coloring and numbers indicate the position of the homologous sequence in the plastid chromosome based on the structure reported in A. The orientation of NUPTs relative to ptDNA is indicated by arrowheads; arrowheads pointing to the *left* indicate a reverse orientation. In the case of the IR regions of the ptDNA, the position of both homologous ptDNA sequences is provided. Fragments of ptDNA deleted from the nuclear integration are indicated by triangles with the size of the deletion indicated (e.g. “–14”) (see Table 3); short insertions or duplications are indicated accordingly (e.g. “+45”; duplications are highlighted by an asterisk; Supplemental Table 2). For , the long insertion of nonorganelle DNA is indicated by a black line and the letter “i” (Supplemental Table 1).

formula image — **Figure 1.**
Structure of long continuous insertions of ptDNA in the nuclear genome of *O. sativa*. (A) Structure of rice ptDNA (Os-pt). The position of the long (LSC) and short (SSC) single-copy regions, as well as of the two inverted repeats (IR_A and IR_B) are indicated. The four nuclear insertions (see *bottom*) are depicted as black lines below their regions of origin in the rice plastid chromosome. When the origin of a fragment of nuclear ptDNA was ambiguous, i.e., when derived from repetitive regions, it was assigned to the repeat located most 5′ of the organellar DNA. For instance, NUPTs solely homologous to IR-specific sequences were always assigned to IR_B.(B) Structure of the four nuclear insertions of rice ptDNA , , and . Coloring and numbers indicate the position of the homologous sequence in the plastid chromosome based on the structure reported in A. The orientation of NUPTs relative to ptDNA is indicated by arrowheads; arrowheads pointing to the *left* indicate a reverse orientation. In the case of the IR regions of the ptDNA, the position of both homologous ptDNA sequences is provided. Fragments of ptDNA deleted from the nuclear integration are indicated by triangles with the size of the deletion indicated (e.g. “–14”) (see Table 3); short insertions or duplications are indicated accordingly (e.g. “+45”; duplications are highlighted by an asterisk; Supplemental Table 2). For , the long insertion of nonorganelle DNA is indicated by a black line and the letter “i” (Supplemental Table 1).

**Figure 2.**
Structures of the complex integrants and , which contain rearranged DNA from the rice plastid chromosome. A and B are depicted as in Figure 1. For , the two large insertions of nonorganelle DNA are designated as “i1” and “i2” (Supplemental Table 1).

**Figure 3.**
Structures of the complex integrants and , which contain rearranged DNA from the mitochondrial chromosome of *Arabidopsis* and rice, respectively. (A) Structure of the *Arabidopsis* (At-mt) and rice (Os-mt) mtDNAs. For the *Arabidopsis* mtDNA, the position of the four single-copy regions A, B, C, and D, as well as the three pairs of specific repeats (I, II, and III), are presented. Repeats I (positions 44,698–48,894 and 178,863–183,059) and II (103,805–104,337 and 227,087–227,619) are directly oriented, while the two repeat III sequences (112,147–118,736 and 297,580–290,991) are inverted. The portion of the mitochondrial chromosome included in each nuclear insertion (*bottom*) is indicated by black lines (as in the previous figures). (B) Structure of and , depicted according to the scheme used in Figures 1 and 2. For , small NUMTs are indicated by Arabic numerals and the positions of their homologs in the *Arabidopsis* mtDNA are as follows: 1 (170,597–170,530); 2 (170,530–170,597); 3 (191,246–191,300); 4 (25,397–25,647); 5 (28,644–28,687); 6 (279,338–279,090); 7 (280,258–279,809); and 8 (191,552–192,070). Four major rearrangements can be recognized (see text). (1) A 5.5-kb region, derived from the D-region, is duplicated (positions 1–5545 and 47,106–52,650) in . The two sequences are more similar to each other than to mtDNA, and both harbor a characteristic 68-bp insertion derived from region C (designated as “1” and “2”), suggesting that the insertion of this specific mtDNA segment in the nucleus occurred before the duplication. (2) A 2.6-kb region (positions 5546–8207), derived from region C, is inserted between the D′ terminus of the mtDNA insertion (see above) and the 5.5-kb duplication described above. (3) A 1.8-kb stretch, containing a structure of six short NUMTs, together with very short stretches of nonorganelle DNA, is present at positions 74,295–76,084. The six NUMTs derive from all four major regions of the mtDNA (including those absent at the A/D junction: see text and below). The 1.8-kb insertion is flanked by a duplication of 9 bp (CTTTACGAG) present in the D region, implying that it was inserted after formation of the D′–A′–C–B structure. (4) At positions 129,022–129,597, between A region and III-type repeat, a short stretch of the B region is found. This could be the result of imprecise homologous recombination between repeat III sequences, affecting the B region adjacent to one of the repeats. Alternatively, this short region might represent the beginning of the 350-kb duplication that was previously detected in , but not sequenced (Stupar et al. 2001). For , the two large insertions of nonorganelle DNA into nuclear DNA of organellar origin are designated as “i1” and “i2” (Supplemental Table 1). (C) Rearrangements of the *Arabidopsis* mtDNA due to recombination across repeat regions. The four single-copy regions A, B, C, and D are separated by two pairs of repeats (I: direct repeats, III: inverted repeats); recombinations involving repeat II sequences are not shown because they are not relevant for the generation of . Recombination across repeat II sequences results in A–D and B–C circles, while the A–B–D′–C′ structure derives from III–III recombination (D′ and C′ refer to inverted D and C sequences, respectively). The A–D–B′–C′ arrangement originates from the normal A–B–C–D structure by two rounds of recombination; the first, between the pair of repeat III sequences, inverts the orientation of one of the repeat I sequences, allowing a second recombination between the now inverted repeat I sequences, resulting in an A–D–B′–C′ circle. Another alternative structure, A–C′–B–D′, has been described before (Klein et al. 1994; Unseld et al. 1997). (D) Origin of . The structure of is depicted as a circle around mtDNA of the A–D–B′–C′-type. The insertion contains the four major segments of mtDNA in the order D′–A′–C–B, whereby parts of D′ and A′ are either absent in the nuclear mtDNA sequence, or form part of the 350-kb duplication for which no sequence information is available (Stupar et al. 2001). The deletion at the A/D junction is indicated by the dotted segment of the circle and contains mtDNA from position 183,060–209, 928 (region D), an entire I-type repeat, and the region extending from 343,608 to 44,697 (from region A) (see A). The breakpoint (indicated by an asterisk) that gave rise to the linear D′–A′–C–B structure should be located in region D, close to the type III repeat; in fact, both ends of consist of sequences from the D region. Additional rearrangements of , in particular the duplication resulting in the 350-kb region identified before (Stupar et al. 2001), are indicated by arrows.

**Figure 4.**
Structure of the complex insertions , , , and , which contain rearranged DNA from both ptDNA and mtDNA of (A) The structures of mtDNA and ptDNA and the portions found in the five insertions (see B) are shown, following the scheme used in the previous figures. In B, the structures of the five nuclear insertions containing both ptDNA and mtDNA (, , , and ) are shown. For the four large inserts of nonorganelle DNA are designated as “i1” to “i4”, and for and as “i” (Supplemental Table 1).

**Figure 5.**
Relationships between base composition, structure, and sequence divergence of large insertions of nuclear organelle DNA. (A) The G/C content of the insertion comes to resemble that of its nuclear chromosomal neighborhood. The primary insertion is assumed to have had the same G/C content as the corresponding region of pt/mtDNA. The G/C contents of the chromosomal regions hosting the insertions are based on those of the 300 kb of nuclear DNA immediately flanking the respective insertion. White boxes indicate segments with an overall similarity of >99.5% to pt/mtDNA; the insertions indicated as gray boxes are less similar to organellar DNA. The two integrants with the highest transition/transversion ratios are indicated in bold. (B) Relationship between average fragment size and sequence divergence. The level of sequence identity between insertion and organellar DNA was calculated using the BESTFIT algorithm of the GCG package, which considers both nucleotide exchanges and InDels (Devereux et al. 1984). Average fragment sizes were obtained from columns 2 and 4 of Table 1. Squares indicate long continuous integrants; open circles symbolize complex insertions derived from one organelle; and shaded circles stand for complex insertions, including sequences from both organelles. (C) Intra-insertion patterns of sequence divergence. Sequence identity between NUPTs and ptDNA and between NUMTs and mtDNA was assessed for the five largest complex loci as in B. NUPTs and NUMTs larger than 1 kb were considered. Insertion served as control. It consists of one large continuous NUPT subdivided into sets of fragments with sizes ranging from 20 kp to 1 kb; the level of its overall sequence identity is indicated by a gray line. In , NUPTs are indicated by continuous lines and NUMTs by dotted lines. Note that the more diverged fragments in (26,331–31,363; 31,686–45,533; 46,267–53,135) are mostly derived from the IR region; in (55,367–56,927; 66,220–68,636; 75,288–81,711; 81,712–83,231) they originate from the IR region, and from positions 55,000–60,000 of ptDNA. The most diverged regions in (10,721–16,939; 16,940–20,685) derive again from IR sequences.

See this image and copyright information in PMC

Cited by

Environmental stress increases the entry of cytoplasmic organellar DNA into the nucleus in plants.
Wang D, Lloyd AH, Timmis JN. Wang D, et al. Proc Natl Acad Sci U S A. 2012 Feb 14;109(7):2444-8. doi: 10.1073/pnas.1117890109. Epub 2012 Jan 30. Proc Natl Acad Sci U S A. 2012. PMID: 22308419 Free PMC article.
Organellar genome analysis reveals endosymbiotic gene transfers in tomato.
Kim HT, Lee JM. Kim HT, et al. PLoS One. 2018 Sep 5;13(9):e0202279. doi: 10.1371/journal.pone.0202279. eCollection 2018. PLoS One. 2018. PMID: 30183712 Free PMC article.
Mitochondrial DNA transfer to the nucleus generates extensive insertion site variation in maize.
Lough AN, Roark LM, Kato A, Ream TS, Lamb JC, Birchler JA, Newton KJ. Lough AN, et al. Genetics. 2008 Jan;178(1):47-55. doi: 10.1534/genetics.107.079624. Genetics. 2008. PMID: 18202357 Free PMC article.
Nuclear DNA segments homologous to mitochondrial DNA are obstacles for detecting heteroplasmy in sugar beet (Beta vulgaris L.).
Taniguchi E, Satoh K, Ohkubo M, Ue S, Matsuhira H, Kuroda Y, Kubo T, Kitazaki K. Taniguchi E, et al. PLoS One. 2023 Aug 8;18(8):e0285430. doi: 10.1371/journal.pone.0285430. eCollection 2023. PLoS One. 2023. PMID: 37552681 Free PMC article.
Chimeric deletion mutation of rpoC2 underlies the leaf-patterning of Clivia miniata var. variegata.
Wu Y, Zheng Y, Xu W, Zhang Z, Li L, Wang Y, Cui J, Wang QM. Wu Y, et al. Plant Cell Rep. 2023 Sep;42(9):1419-1431. doi: 10.1007/s00299-023-03039-0. Epub 2023 Jun 16. Plant Cell Rep. 2023. PMID: 37326841

See all "Cited by" articles

References

1. Adams, K.L. and Palmer, J.D. 2003. Evolution of mitochondrial gene content: Gene loss and transfer to the nucleus. Mol. Phylogenet. Evol. 29: 380–395. - PubMed
1. Adams, K.L., Daley, D.O., Qiu, Y.L., Whelan, J., and Palmer, J.D. 2000. Repeated, recent and diverse transfers of a mitochondrial gene to the nucleus in flowering plants. Nature 408: 354–357. - PubMed
1. Adams, K.L., Rosenblueth, M., Qiu, Y.L., and Palmer, J.D. 2001. Multiple losses and transfers to the nucleus of two mitochondrial succinate dehydrogenase genes during angiosperm evolution. Genetics 158: 1289–1300. - PMC - PubMed
1. Altschul, S.F., Gish, W., Miller, W., Myers, E.W., and Lipman, D.J. 1990. Basic local alignment search tool. J. Mol. Biol. 215: 403–410. - PubMed
1. Ayliffe, M.A., Scott, N.S., and Timmis, J.N. 1998. Analysis of plastid DNA-like sequences within the nuclear genomes of higher plants. Mol. Biol. Evol. 15: 738–745. - PubMed

Web site references

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions
Actions

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Generation and evolutionary fate of insertions of organelle DNA in the nuclear genomes of flowering plants

Affiliation

Generation and evolutionary fate of insertions of organelle DNA in the nuclear genomes of flowering plants

Authors

Affiliation

Abstract

Figures

Similar articles

Cited by

References

Web site references

Publication types

MeSH terms

Substances

LinkOut - more resources

Full Text Sources

Abstract

Figures

Similar articles

Cited by

References

Web site references

Publication types

MeSH terms

Substances

Related information

LinkOut - more resources

Full Text Sources