Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2008 Sep 5;4(9):e1000175.
doi: 10.1371/journal.pgen.1000175.

Segmental duplications arise from Pol32-dependent repair of broken forks through two alternative replication-based mechanisms

Affiliations

Segmental duplications arise from Pol32-dependent repair of broken forks through two alternative replication-based mechanisms

Celia Payen et al. PLoS Genet. .

Abstract

The propensity of segmental duplications (SDs) to promote genomic instability is of increasing interest since their involvement in numerous human genomic diseases and cancers was revealed. However, the mechanism(s) responsible for their appearance remain mostly speculative. Here, we show that in budding yeast, replication accidents, which are most likely transformed into broken forks, play a causal role in the formation of SDs. The Pol32 subunit of the major replicative polymerase Poldelta is required for all SD formation, demonstrating that SDs result from untimely DNA synthesis rather than from unequal crossing-over. Although Pol32 is known to be required for classical (Rad52-dependant) break-induced replication, only half of the SDs can be attributed to this mechanism. The remaining SDs are generated through a Rad52-independent mechanism of template switching between microsatellites or microhomologous sequences. This new mechanism, named microhomology/microsatellite-induced replication (MMIR), differs from all known DNA double-strand break repair pathways, as MMIR-mediated duplications still occur in the combined absence of homologous recombination, microhomology-mediated, and nonhomologous end joining machineries. The interplay between these two replication-based pathways explains important features of higher eukaryotic genomes, such as the strong, but not strict, association between SDs and transposable elements, as well as the frequent formation of oncogenic fusion genes generating protein innovations at SD junctions.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. Segmental duplication assays.
(A) Growth recovery assay . Black circles and triangles represent centromeres and telomeres, respectively. White open arrow represents the RPL20B gene (YOR312C) whose duplication is selected for. Yellow and pink boxes denote intra- (left) and one type of inter-chromosomal (right) duplications, respectively. A non-reciprocal translocation event between the right arm of chromosome XV and another chromosome (denoted “n”) is represented: for other types of inter-chromosomal SD (i.e. chimerical supernumerary chromosome and unequal reciprocal translocation, see and [21]). SD size ranges are indicated below the double-headed arrows. (B) Uracil prototrophy recovery assay. Top: schematic representation of the right arm of chromosome XV spanning the RPL20B locus and the two flanking Ty3 LTRs (YORWsigma3 and YORWsigma4) located 115 kb apart from each others. 5′- and 3′-truncated are either inserted next (YORWsigma3) or replaces (YORWsigma4) Ty3 sequences. The “R”-labeled red box indicates the 58 or 401 bp overlap between the two truncated URA3 cassettes. Bottom: a functional URA3 gene restoring uracil prototrophy is generated through 115 kb direct-tandem duplication events involving the overlapping sequences. (C) Size distribution of intra-chromosomal SDs. The x and y-axis of the diagram indicate the strain background and the percentage of events recovered, respectively. Yellow, violet and blue bars represent the proportion of duplications larger, equal to and smaller than 115 kb, respectively (with the actual number of events analyzed indicated in the table below). (D) Phenomenology of SD formation. Protein names involved in the different steps are indicated to the left of the diagram. Red, orange and blue names represent proteins whose deletions abolish, reduce and increase SD formation, respectively. Light and medium grey boxes indicate the two alternative mechanisms of SD formation, BIR (Break-induced Replication) and MMIR (Microhomology/Microsatellite-induced Replication), respectively. CPT = camptothecin.
Figure 2
Figure 2. Representative breakpoint sequences of non LTR-mediated duplications.
Only events leading to chimerical ORF are presented. a WT junctions are from . Top and bottom sequences correspond to centromere-distal and -proximal sequences, respectively, followed by the name of the genetic element involved at the junctions. Shaded areas indicate the regions of sequence identity shared by these two sequences and correspond to the breakpoint per se. The coordinates in brackets correspond to the first nucleotide position within the shaded areas. For each strain, the middle sequence corresponds to the actual breakpoint sequence followed by a description of the chimerical genetic element recovered at the junction. Neighboring elements correspond to sequences known to participate or interfere with replication, with slow-zone corresponding to inflection point in the replication pattern (i.e. regions where fork progression slows down, [67]), ter site to termination regions, and ARS to autonomous replicating sequences. On the right, the schematic representations with orientated grey and black boxes represent the structure of the chimerical elements generated with the sizes of the corresponding chimerical ORFs (in aa). The contribution (in aa) of each of the two elements involved in the fusion is indicated above and below the corresponding boxes. Amino acids encoded by a frame different from that of the original elements are referred as “new”.
Figure 3
Figure 3. Potential interplay between replication factories and SD formation.
The top two drawings illustrate the conservation of the physical distance between two forks within the same replication bubble as elongation proceeds. Red/blue and grey bubbles symbolize replicons located on two different chromosomes but co-localizing within the same replication factory. A broken fork can be repaired either in a Rad52-dependent (i) or in a Rad52-independent (ii) manner. Rad52-dependent annealing could be achieved through interaction with a sequence from either the same or a different replication bubble (symbolized by the sharp sign between the two replication factories) leading to intra- or inter-chromosomal SD. In a rad52Δ strain, SDs are on average shorter and almost exclusively intra-chromosomal (bottom right schematic representation), suggesting a Rad52-independent preferential association between sequences originating from the same replication bubble during the annealing step. A schematic representation of the resulting duplications is presented at the bottom.

Similar articles

Cited by

References

    1. Bailey JA, Eichler EE. Primate segmental duplications: crucibles of evolution, diversity and disease. Nat Rev Genet. 2006;7:552–564. - PubMed
    1. Sharp AJ, Hansen S, Selzer RR, Cheng Z, Regan R, et al. Discovery of previously unidentified genomic disorders from the duplication architecture of the human genome. Nat Genet. 2006;38:1038–1042. - PubMed
    1. Courseaux A, Nahon JL. Birth of two chimeric genes in the Hominidae lineage. Science. 2001;291:1293–1297. - PubMed
    1. Courseaux A, Richard F, Grosgeorge J, Ortola C, Viale A, et al. Segmental duplications in euchromatic regions of human chromosome 5: a source of evolutionary instability and transcriptional innovation. Genome Res. 2003;13:369–381. - PMC - PubMed
    1. Inoue K, Dewar K, Katsanis N, Reiter LT, Lander ES, et al. The 1.4-Mb CMT1A duplication/HNPP deletion genomic region reveals unique genome architectural features and provides insights into the recent evolution of new genes. Genome Res. 2001;11:1018–1033. - PMC - PubMed

Publication types

MeSH terms

Substances