Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Mar 6;10(3):e1004169.
doi: 10.1371/journal.pgen.1004169. eCollection 2014 Mar.

GC-rich DNA elements enable replication origin activity in the methylotrophic yeast Pichia pastoris

Affiliations

GC-rich DNA elements enable replication origin activity in the methylotrophic yeast Pichia pastoris

Ivan Liachko et al. PLoS Genet. .

Abstract

The well-studied DNA replication origins of the model budding and fission yeasts are A/T-rich elements. However, unlike their yeast counterparts, both plant and metazoan origins are G/C-rich and are associated with transcription start sites. Here we show that an industrially important methylotrophic budding yeast, Pichia pastoris, simultaneously employs at least two types of replication origins--a G/C-rich type associated with transcription start sites and an A/T-rich type more reminiscent of typical budding and fission yeast origins. We used a suite of massively parallel sequencing tools to map and dissect P. pastoris origins comprehensively, to measure their replication dynamics, and to assay the global positioning of nucleosomes across the genome. Our results suggest that some functional overlap exists between promoter sequences and G/C-rich replication origins in P. pastoris and imply an evolutionary bifurcation of the modes of replication initiation.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. Mapping of replication origins in P. pastoris.
(A) Schematic of ARS-seq and miniARS-seq screens. Fragmented genomic DNA was ligated into non-replicating URA3 vectors and screened for ARS activity followed by deep sequencing of the resultant plasmid inserts (ARS-seq, top). ARS-seq plasmid inserts were amplified and sheared using DNase I. Short fragments of ARSs were ligated into the URA3 vectors and screened for ARS activity followed by deep sequencing of the plasmid inserts (miniARS-seq, bottom). (B) The GC-ACS motif identified by the MEME algorithm. (C) The distribution of MAST motif scores of the best match to the GC-ACS in every PpARS. (D) 2D gel analysis at loci A2772 (putative AT-ARS at chromosome 1: 2,772 kb) and C379 (putative GC-ARS at chromosome 3: 379 kb). The red arrows highlight arcs corresponding to replication bubble intermediates.
Figure 2
Figure 2. The GC-ACS is required for GC-ARS function.
Wild type (WT) and mutant (MUT) alleles of the twelve ARSs indicated were cloned into a URA3 ARS-less vector and used to transform ura3 yeast on selective medium plates lacking uracil. Plates were grown at 30°C for five days before pictures were taken. Colony formation indicates plasmid maintenance and ARS activity. The GC-ACS was positioned <15 bp away from the 5′ endpoint in all ARS sequences. The sequences of the fragments tested are listed in Table S4.
Figure 3
Figure 3. Deep mutational scanning of P. pastoris ARSs.
(A) Schematic of the mutARS-seq deep mutational scanning experiment. Auxotrophic ura3 yeast were transformed with a library of mutant ARS variants and competed in selective medium. The abundance of different ARS variants was determined by deep sequencing at intervals during competitive growth. (B) Results of mutARS-seq of ARS-C379. The relevant sequence of ARS-C379 is shown with the best match to the GC-ACS motif highlighted in red (and a 3′ constrained dinucleotide highlighted in blue). The log-transformed enrichment ratio is shown for each nucleotide at each position along the sequence. (C) Results of mutARS-seq of ARS-A2772. Same as in (B), except that the motif logo shown was constructed from the enrichment ratio scores post-analysis, whereas the motif shown in (B) was constructed from ARS alignments.
Figure 4
Figure 4. Replication timing of the P. pastoris genome.
(A) Genomic DNA from G1 and S phase cells was sheared and sequenced. Normalized S/G1 DNA copy ratios (in 1 kbp windows) were smoothed and plotted against chromosomal coordinates. Peaks correspond to positions of replication initiation. The profile of chromosome 4 is shown (all chromosomes are shown in Figure S6) with ARS locations indicated by open (AT-ARSs) and shaded (GC-ARSs) circles. Un-smoothed ratio data for one of the replicates is shown are grey. Coordinates of replication timing peaks are indicated by dashed vertical lines. (B) The distributions of smoothed S/G1 ratio data. The distribution of all ratios (“Genome”) is shown adjacent to the distribution of values at bins containing midpoints of GC-ACSs (“GC”) or AT-ARSs (“AT”). Values for ARSs that have no other ARSs within 40 kb in both directions are shown on the right (“isolated”). (C) The complete genomic ratio distribution is shown relative to distributions after removal of data within 60 kb ranges centered on AT-ARSs (“AT”), GC-ARSs (“GC”), or all ARSs (“all ARS”). (D) For each ARS, the distance to the nearest replication peak was calculated. The ARS-peak distances are shown as distributions separately for GC-ARSs (blue) and AT-ARSs (orange). Peak distances from simulated random sets of loci are shown in grey.
Figure 5
Figure 5. Nucleosome profile of P. pastoris.
Nucleosome density is plotted for sites centered on all TSSs as a control to test the overall quality of the mapping data (left), non-overlapping GC-ARS sites with a single match to the GC-ACS (middle), or the A/T-rich motif shown in Figure 3C (right). TSS sites are ranked based on expression in the SDEG condition . GC-ARS and AT-ARS sites are ranked by the strength of the best match to the G/C- and the A/T-rich motif respectively.
Figure 6
Figure 6. Sequence features of GC-ARSs.
(A) Average nucleotide frequencies around 107 GC-ARS sites (top) and twenty-eight non-ARS intergenic occurrences of the GC-ACS (bottom), centered on the best match of the GC-ACS. The nucleotide frequencies are calculated at all flanking regions around the motif independent of whether the flanking region is present in ARS contigs or cores. (B) The distribution of distances between the GC-ACS motif (in the orientation shown) and the TSS for adjacent genes transcribing away from the ARS with available TSS annotations. Distances to the 5′ side of the motif are shown in blue; distances to the 3′ side of the motif are shown in red. (C) The distribution of sequence lengths between the GC-ACS and the end of the inferred functional core region for each GC-ARS. The 5′ distance is indicated in blue; the 3′ distance is indicated in red. Numbers indicate the upper limit of the bin.

References

    1. Méchali M (2010) Eukaryotic DNA replication origins: many choices for appropriate answers. Nat Rev Mol Cell Biol 11: 728–738 doi:10.1038/nrm2976 - DOI - PubMed
    1. Eaton ML, Prinz JA, MacAlpine HK, Tretyakov G, Kharchenko PV, et al. (2011) Chromatin signatures of the Drosophila replication program. Genome Res 21: 164–174 doi:10.1101/gr.116038.110 - DOI - PMC - PubMed
    1. Dellino GI, Cittaro D, Piccioni R, Luzi L, Banfi S, et al. (2013) Genome-wide mapping of human DNA-replication origins: Levels of transcription at ORC1 sites regulate origin selection and replication timing. Genome Res 23: 1–11 doi:10.1101/gr.142331.112 - DOI - PMC - PubMed
    1. Costas C, la Paz Sanchez de M, Stroud H, Yu Y, Oliveros JC, et al. (2011) Genome-wide mapping of Arabidopsis thaliana origins of DNA replication and their associated epigenetic marks. Nat Struct Mol Biol 18: 395–400 doi:10.1038/nsmb.1988 - DOI - PMC - PubMed
    1. Hansen RS, Thomas S, Sandstrom R, Canfield TK, Thurman RE, et al. (2010) Sequencing newly replicated DNA reveals widespread plasticity in human replication timing. Proc Natl Acad Sci USA 107: 139–144 doi:10.1073/pnas.0912402107 - DOI - PMC - PubMed

Publication types