Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Jan;589(7842):462-467.
doi: 10.1038/s41586-020-03052-3. Epub 2020 Dec 16.

Measuring DNA mechanics on the genome scale

Affiliations

Measuring DNA mechanics on the genome scale

Aakash Basu et al. Nature. 2021 Jan.

Abstract

Mechanical deformations of DNA such as bending are ubiquitous and have been implicated in diverse cellular functions1. However, the lack of high-throughput tools to measure the mechanical properties of DNA has limited our understanding of how DNA mechanics influence chromatin transactions across the genome. Here we develop 'loop-seq'-a high-throughput assay to measure the propensity for DNA looping-and determine the intrinsic cyclizabilities of 270,806 50-base-pair DNA fragments that span Saccharomyces cerevisiae chromosome V, other genomic regions, and random sequences. We found sequence-encoded regions of unusually low bendability within nucleosome-depleted regions upstream of transcription start sites (TSSs). Low bendability of linker DNA inhibits nucleosome sliding into the linker by the chromatin remodeller INO80, which explains how INO80 can define nucleosome-depleted regions in the absence of other factors2. Chromosome-wide, nucleosomes were characterized by high DNA bendability near dyads and low bendability near linkers. This contrast increases for deeper gene-body nucleosomes but disappears after random substitution of synonymous codons, which suggests that the evolution of codon choice has been influenced by DNA mechanics around gene-body nucleosomes. Furthermore, we show that local DNA mechanics affect transcription through TSS-proximal nucleosomes. Overall, this genome-scale map of DNA mechanics indicates a 'mechanical code' with broad functional implications.

PubMed Disclaimer

Conflict of interest statement

Author Information Statement: The authors declare no financial or non-financial competing interests. All correspondence and requests for materials should be directed to Taekjip Ha (tjha@jhu.edu).

Figures

Extended Data Fig. 1:
Extended Data Fig. 1:. Timecourse loop-seq.
a, Looping kinetic curves of two individual sequences that were part of the Cerevisiae Nucleosomal Library (Supplementary Note 4), obtained by performing two individual smFRET experiments (Fig. 1a–c) as well as timecourse loop-seq (Supplementary Note 3) on the library. b, Looping times of 10 sequences that were part of the Cerevisiae Nucleosomal Library obtained from 10 individual smFRET experiments (Fig. 1c) vs looping times obtained by performing timecourse loop-seq on the library. Pearson’s r = 0.84. 95% CI = [0.44, 0.96]. p = 0.002 (t-test, two-sided). c, Looping barriers (natural logarithm of the looping times, see Supplementary Note 2) of all 19,907 sequences in the Cerevisiae Nucleosomal Library obtained by performing timecourse loop-seq vs the corresponding cyclizability values obtained by performing regular loop-seq involving 1 minute of DNA looping prior to RecBCD digestion. Pearson’s r = 0.833. 95% CI = [0.829, 0.837]. p < 0.00001 (t-test, two sided).
Extended Data Fig. 2:
Extended Data Fig. 2:. Pre-looped and looped geometries.
Prior to annealing of the ends, the DNA rapidly samples various configurations where the ends are far apart or closer together, described here for simplicity as a rapid equilibrium between two representative states. As described earlier, annealing captures the state where the ends are close together. Thus the rate of looping as measured in the FRET based assay reports on the equilibrium population fraction of the state where the ends are close but not annealed, irrespective of the exact shape and geometry of the subsequent annealed state. It thus addresses the biological question of how quickly regions of DNA can approach, which can then be stabilized by protein binding. However, nucleoprotein complex formation may require not just the ends to approach, but the intervening DNA to also adopt a certain shape, and the readout of looping rate does not distinguish between these possible shapes. Various shapes have been proposed for the subsequent annealed state, such as a teardrop configuration where the nicks are open and basepair stacking across nicks is disrupted, and a smooth state where basepair stacking is preserved across the nicks. Other non-canonical geometries may also be possible, such as a U-turn geometry, where the sticky overhangs interact via reverse Watson-Crick basepairing in a parallel stranded configuration. Conventional Watson-Crick base pairing between the overhangs, but in a geometry similar to the U-turn configuration, has been achieved in the past (hairpin loop). Presence of mismatches or transient defects in the duplex region may influence whether such a geometry is preferred over the teardrop or smooth configuration. Because all members of various libraries in our study used the same overhang sequences, relative differences between them in their looping kinetics are unlikely to be affected by a potential conformational heterogeneity of the looped state.
Extended Data Fig. 3:
Extended Data Fig. 3:. Controls pertaining to the loop-seq assay.
a, Seven of the ten sequences whose looping times were measured using smFRET (Sequences 1 – 7 in Fig. 1c, and listed in Supplementary Note 1) were combined to form a small library. Plotted are the cyclizability values obtained by performing loop-seq on this library vs the looping times obtained from the individual smFRET experiments. Pearson’s r = −0.977. 95% CI = [−0.848 −0.997]. p = 0.0002 (t-test, two-sided). This plot is similar to that in Fig. 1f, except that loop-seq was performed on a much smaller library comprising only these seven sequences. This control serves to revalidate the anti-correlative relation between looping time and cyclizability and confirm reproducibility of loop-seq measurements. b, Regular PCR of the entire library containing multiple templates can generate incorrectly annealed products that are annealed via the 25 bp identical adapters at the ends but have 50 bp bubbles in the middle. Such constructs would likely be extremely flexible and would be protected from digestion due to rapid cyclization. Emulsion PCR separates the templates in individual droplets, thereby preventing incorrect annealing between different templates. We performed a control experiment to verify that emulsion PCR of the library does not affect the measured value of cyclizability. In one case, seven template sequences (Sequences 1 – 7 as listed in supplementary note 1 and Fig. 1c) were mixed and then a single round of ePCR was performed to form a small library. In another case, seven separate regular PCR amplifications were carried out for the seven template molecules. The amplified products were then mixed in equimolar proportions to form the library. Loop-seq was performed on these two 7 member libraries and two sets of cyclizabilities of the seven sequences were measured. As indicated by the plot here, these values are highly correlated. Pearson’s r = 0.992. 95% CI = [0.951, 0.999]. p = 8E-6 (t-test, two-sided). c, Technical replicates of loop-seq performed on the Cerevisiae Nucleosomal Library (Supplementary Note 4). Pearson’s r = 0.869 95% CI = [0.865, 0.872]. p < 0.00001 (calculated using two-sided t-test). p value was obtained from a two-sided t-test. The same original library as provided by the manufacturer (Genscript) was used, but all subsequent steps were performed independently. d, This control was performed to verify the expectation that DNA sequences which are more bendable and thus loop quickly under high salt conditions are slow to unloop under low salt conditions. In red is the looping kinetics of Sequence 6, and in green is that of Sequence 7 (Fig. 1c) measured using smFRET. For the unlooping measurements, the slide containing nicked DNA was incubated for 2 hours with high salt imaging buffer containing 1M NaCl. After that, low salt imaging buffer containing no added NaCl was flowed in and the percentage of molecules in high FRET as a function of time was measured. e, This control was performed to verify that while performing loop-seq, molecules do not significantly unloop during the 20 minutes of digestion with RecBCD (see methods). If they do unloop, they would be immediately digested, which would affect the measurement of cyclizability. Sequence 5 (Supplementary Note 1) was used in this experiment. We find that digestion buffer (without the RecBCD enzyme) is in itself capable of looping molecules, and that too at a slightly faster rate than in the presence of looping buffer (which has 1M NaCl, see methods). This is owing to the presence of Mg2+ ions in the digestion buffer, which we know to also effect looping by allowing stable hybridization of the ends. Thus, molecules looped in the presence of 1M NaCl in 1 minute are expected to stay looped during the subsequent 20 minutes of digestion with RecBCD. f, This control was performed to understand the effect of the orientation of the central 50 bp sequence on the measured value of cyclizability. First the Random Library was constructed (where the sequence of DNA in the 50 bp central variable region are randomly selected – see Supplementary Note 5). Then a library called “Mixed Reverse Complement of the Random Library and Random Library” was constructed by mixing the Random Library with another library, where every sequences in the Random Library was present, but had its central 50 bp variable region flipped (Supplementary Note 6). Loop-seq was performed on this new library. We found that the cyclizability of a sequence represented in the Random Library half of this new library was correlated with that of the corresponding sequence in the other half, where the central 50 bp region was flipped. Pearson’s r = 0.73. 95% CI = [0.72, 0.74]. p < 0.00001, calculated using t-test (two-sided). g, This plot serves to confirm the expectations that RecBCD does not digest looped molecules, and that over sufficient time, most molecules, even rigid ones, will loop. During timecourse loop-seq (Supplementary Note 3), the original sample was split into 8 identical fractions. Looping for various amounts of time and subsequent digestion was carried out for 7 of the 8 fractions, while one fraction was not subject to any digestion. All fractions were then PCR amplified (16 cycles) under identical conditions (see methods). Plotted are the concentrations of DNA obtained after PCR vs the corresponding times the samples were subject to the looping condition. These data points were fit to an exponential curve (solid line). The [DNA] obtained when no digestion was performed is represented as the dashed horizontal line. The fact that the fitted exponential approaches the dashed line indicates that for very long looping times, almost all molecules, even very rigid ones, have had sufficient time to loop and hence are protected from digestion. Thus, in this case, the concentration of DNA obtained after PCR of all surviving molecules approaches that of the fraction where no digestion was performed at all. Whether RecBCD would digest molecules sealed via non-canonical parallel basepairing or other geometries (Extended Data Fig. 1) is not known. However, this control suggests that either it does not, or such unconventional basepairings are rare. h, These plots serve to further demonstrate that if the library is permitted to loop for a very long time, most molecules, even very rigid ones, will loop, and that looped molecules are protected from subsequent digestion with RecBCD. In this case, the relative populations of various sequences measured after digestion should be similar to the case where no digestion was performed at all. We find this to indeed be the case: there is good correlation between the relative population of a sequence in sample 1 of timecourse loop-seq (no digestion) and sample 8 (2 hours of looping followed by digestion). However, correlation between the relative population of a sequence in sample 1 and that of a sequence in sample 2 (1 minute of looping prior to digestion) is much weaker. This is because in sample 2, only those molecules whose sequences render them bendable enough to loop under 1 minute are protected from subsequent RecBCD digestion. i, Technical replicates of the looping kinetic curve of sequence number 3 (Supplementary Note 1) measured using smFRET.
Extended Data Fig. 4:
Extended Data Fig. 4:. Dependence of cyclizability on tether geometry and rotational phasing.
a, Loop-seq was performed on the Random Library (Supplementary Note 5). Plotted is the mean A/T content as a function of position along the central variable 50 bp region, where the mean is calculated by averaging over the 1,000 most cyclizable (red) or least cyclizable (blue) sequences. The value of n (n = 26) is the distance in nucleotides of the biotin tether from the end of each molecule in the library (Fig. 1d). b, In an untethered geometry, sequence features such as the phase of the oscillations in A/T content may result in the looped configuration having a preference for a certain poloidal angle (rotation along the long axis of DNA). Preference for a certain poloidal angle translates to a preference for a certain orientation of the biotin-streptavidin tether. Shown above are two extreme cases – in one case, the poloidal angle preference of the sequence results in a preferential orientation of the tether on the outside, while in the other case, the tether points to the inside at the point of contact with the DNA. As the biotin-streptavidin-complex is quite large, the outside orientation may be more favored for looping owing to steric considerations. The outside orientation can be converted to the inside orientation by moving the biotin tether point to a base that is half the DNA helical repeat away. This may explain why the phase of the oscillation of A/T content among the most or least cyclizable sequences shifts by half the helical repeat of DNA when the tether point is also shifted by half the helical repeat of DNA (about 5 bases) (compare panels a and c). c, The random library was re-prepared, placing the biotin 31 nt away from the ends (n = 31) and loop-seq was performed. Plotted are the same quantities as in panel a, except the 1,000 most and least cyclizable sequences of the library were identified based on the newly obtained cyclizability values under the n = 31 condition. d – e, See context in which these panels are referred to in Supplementary Note 7. In panel e, Pearson’s r = 0.987, 95% CI = [0.986, 0.987]. p < 0.00001 (t-test, two-sided). f, Mean A/T content as a function of position along the variable region of the Random Library, where the averaging is done over the 1,000 sequences that have the highest (red) or lowest (blue) values of intrinsic cyclizability. The scale of the axes is the same as in panels a and c. g, Amplitude spectra obtained from the fast Fourier transforms of the plots in panel f (solid lines) and panel a (dashed lines). h, 2D histogram of the scatter plot of measured cyclizability of sequences in the Random Library prepared with the biotin at the very end of the molecule (n = 0 condition) vs its predicted value based on the oscillatory model (equation 1 in Supplementary Note 7). Pearson’s r = 0.787. 95% CI = [0.78, 0.793]. p < 0.00001 (t-test, two-sided). i – k, 2D histogram of scatter plot of measured cyclizabilities of sequences in the Random library prepared at n = 0 vs prepared at n = 26, 29, 31 nt. For panel i, Pearson’s r = 0.38. 95% CI = [0.37, 0.40]. p < 0.00001 (t-test, two-sided). For panel j, Pearson’s r = 0.70. 95% CI = [0.69, 0.71]. p < 0.00001 (t-test, two-sided). For panel k, Pearson’s r = 0.77. 95% CI = [0.76, 0.78]. p < 0.00001 (t-test, two-sided). l, The use of long 10 nt overhangs has been shown to eliminate the need for ligase and to reduce the dependence of looping on rotational phasing between the ends. Shown here is a 2D histogram of the scatter plot of intrinsic cyclizability of a sequence in the random library (which had 50 bp of DNA along the central variable region) vs the corresponding sequence in library L (Supplementary Note 8) where 5 bases were added to the variable region. A correlation coefficient only slightly poorer than the correlation between cyclizability values of the Random Library and the Reverse Complement of the Random Library (Extended Data Fig. 3f) suggests that rotational phasing of the ends does not significantly influence intrinsic cyclizability. Pearson’s r = 0.63. 95% CI = [0.61, 0.65]. p < 0.00001 (t-test, two-sided). See Supplementary Note 8.
Extended Data Fig. 5:
Extended Data Fig. 5:. Intrinsic cyclizability and nucleosome occupancy vs position from the dyads of the +1 nucleosomes of various individual genes in S. cerevisiae.
Plots are as shown for the two individual genes in Fig. 2c. The dashed line marks the ordinate value where intrinsic cyclizability is lowest.
Extended Data Fig. 6:
Extended Data Fig. 6:. DNA pair selection and timecourse of INO80 remodeling.
a, Intrinsic cyclizability as a function of position from the +1 nucleosomal dyad, along the 334th, 75th, and 301st genes in the list of 576 genes along which intrinsic cyclizability was measured (Supplementary Note 9). The 80 bp linker regions of both constructs (with rigid and flexible linkers extending from the 601 sequence) in pairs 1, 2, and 3 along which INO80 sliding extent was measured, were selected from the 334th, 75th, and 301st genes respectively. Genes are oriented in the upstream to downstream direction. Red and green denote the selected 80 bp less rigid and more rigid linker regions respectively. See Supplementary Note 11. b, The remodeling reactions shown in Fig. 2f were all performed for 1 minute of remodeling, under various enzyme concentrations. Here we show, instead, remodeling reaction timecourses at saturating [INO80] for all three pairs. Conditions are identical to that used in Fig. 2f, except that saturating (30 nM) INO80 is used and remodeling is permitted to progress for various amounts of time after addition of ATP (see methods). In all cases, the two constructs in a pair are present simultaneously, and distinguished by imaging the gel once for Cy3 and once for Cy5 fluorescence. Thus, although the sliding extent can be very sensitive to sliding time (especially for short sliding times), robust comparisons of sliding extents can be made between the two constructs in a pair. Quantification is done as explained in supplementary note 11. Sliding on the constructs in pair 1 was repeated in a separate experiment in presence of 30 nM enzyme, and also side by side in presence of 60 nM enzyme. Near identical extents of sliding indicate saturation has been reached at 30 nM INO80. c, Ratio of the fractional sliding extent in the construct formed on the more flexible linker to that formed on the more rigid linker, at various timepoints since addition of ATP, and in presence of 30 nM INO80. The dashed line indicates a ratio of 1. The ratio is computed from the data in panel b. The extent of sliding under saturating enzyme conditions is consistently higher for the construct involving more flexible linker (except, as expected, when the sliding extent approaches 100%). Solid lines connect observations that were made from the same initial reaction volume by sampling its fractions at various timepoints.
Extended Data Fig. 7:
Extended Data Fig. 7:. Control experiments pertaining to the INO80-mediated sliding of nucleosomes as reported in Fig. 2f.
a, In order to assess our confidence in the result that INO80 mediated sliding is greater in the construct with the less rigid linker, we performed nucleosome sliding experiments similar to those reported in Fig. 2f five times for each pair in the presence of 9 nM INO80 and for 2.5 mins of sliding. These constitute technical replicates. The products of sliding were analyzed on a 6% TBE gel as done in Fig. 2f. Each gel was imaged separately for Cy3 and for Cy5 fluorescence and quantified to calculate the fold-difference in sliding extent between the flexible and the rigid construct in each pair. The measurements of fold-differences for each pair are displayed in the box plots, along with the actual data points. The central mark in each box (red) represents the median and the bottom and edges represent the 25th and 75th percentile respectively. The whiskers extent to the most extreme datapoints. Also indicated are the mean, standard deviation (s.d.), and the upper limit of the p value (defined here as the probability of obtaining a fold-difference of 1 if the distribution of fold differences has the same mean and s.d. as that of these 5 measurements) as obtained by the application of Chebyshev’s inequality. Dashed line represents a fold-difference of 1. b, In the experiment described in Fig. 2f, the more rigid construct in all pairs was labeled with Cy3, while the less rigid construct was labeled with Cy5. This control verifies that the result that sliding extent in greater in the less rigid construct is not influenced by different dye properties. We swapped the dyes between the two constructs in pair 1. We then performed nucleosome sliding experiments on this modified pair 1 constructs for the three INO80 concentrations that yielded detectible sliding in Fig. 2f (6, 9, 13 nM), and for 1 minute of sliding as done in Fig. 2f. The products of sliding were analyzed on a 6% TBE gel, and the sliding extents quantified as done in Fig. 2f. We indeed find that even when the dyes are swapped, sliding extent is greater for nucleosomes formed on the less rigid construct. c, To obtain better statistics of sliding along the dye-swapped pair 1 constructs, we repeated one of the conditions in panel b (13 nM INO80, 1 min of sliding) five times. These constitute technical replicates. The measured fold-difference values are displayed in the box plot. The central mark in each box (red) represents the median and the bottom and edges represent the 25th and 75th percentile respectively. The whiskers extent to the most extreme datapoints. Also indicated are the mean, standard deviation (s.d.), and the upper limit of the p value (defined as in panel a) as obtained by the application of Chebyshev’s inequality. Dashed line represents a fold-difference of 1.
Extended Data Fig. 8:
Extended Data Fig. 8:. Intrinsic cyclizability along S. cerevisiae chromosome V at 7 bp resolution.
Data was obtained by performing loop-seq on the ChrV Library (Supplementary note 12).
Extended Data Fig. 9:
Extended Data Fig. 9:. Intrinsic cyclizability along nucleosomes.
a, Distribution of Nucleosome Center Positioning (NCP) scores of all 3,192 S. cerevisiae chromosome V nucleosomes. Quartile and octile boundaries of the distribution are shown in dashed lines and numbered (1 through 4 for quartiles and 1 through 8 for octiles). b, Mean intrinsic cyclizability of DNA as a function of position from the dyads of nucleosomes along yeast chromosome V, averaged over nucleosomes in each octile indicated in panel a. Error extents (shaded background) are s.e.m. c, Mean intrinsic cyclizability as a function of position, averaged over all 3,192 S. cerevisiae chromosome V nucleosomes (solid line). Height of the shaded region is the standard deviation of measurements. d, Scatter plot of the NCP scores of the 227 +1 nucleosomes of the 227 genes identified along chromosome V vs the mean intrinsic cyclizabilities of DNA along the 147 bp that span these nucleosomes. Intrinsic cyclizability values were obtained by performing loop-seq on the ChrV Library. Pearson’s r = 0.28. 95% CI = [0.15, 0.39]. p = 2.6E-5 (t-test, two-sided). e, Plot of intrinsic cyclizability as a function of position along all the 576 genes in the Tiling Library (red), and among 34% of these genes that had the highest (black) or lowest (green) NCP score value of the gene’s +1 nucleosome. Plots were obtained in a manner identical to that in Fig. 2b. Intrinsic cyclizability on either side of the dyad of +1 nucleosomes of genes that have high +1 nucleosome NCP score (black) is asymmetric, being higher on the TSS proximal (i.e. “left”) side of the dyad.
Extended Data Fig. 10:
Extended Data Fig. 10:. Loop-seq measurements compared to expectations based on earlier measurements and models (see Supplementary Note 16).
a, Two sets of 1,000 plots each of A/T content as a function of position along the central 50 bp variable region of 1,000 sequences in the Random Library with the highest and lowest values of intrinsic cyclizability were generated. Fast Fourier transforms of these two sets of 1,000 plots were taken individually and used to calculate a total of 2,000 amplitude spectra. Plotted is the mean of the 1,000 amplitude spectra for the 1,000 sequences that have the highest (red) or lowest (blue) intrinsic cyclizability values. The plot indicates that sequences that have very high or low intrinsic cyclizabilities also tend to be characterized by enhanced or suppressed periodic modulations in AT content respectively at the DNA helical repeat. b, We consider a poly dA:dT stretch which has at least l consecutive A or T nucleotides to be a poly dA:dT tract. For various values of l, plotted are intrinsic cyclizability (top panel), poly dA:dT tract content (middle panel), and nucleosome occupancy (bottom panel) vs position from the dyad of the +1 nucleosome, averaged over all 227 identified genes in S. cerevisiae chromosome V (blue) or over a selected subset of genes that show no peak in poly dA:dT content at the NDR (red; 30%, 62%, and 86% of genes for l = 4 bp, 7 bp, 10 bp respectively). See supplementary note 15 for plotting details, including how poly dA:dT content is defined. c, 2D histogram of the scatter plot between the number of poly dA:dT tracts in the 50 bp variable region and intrinsic cyclizability of sequences in the ChrV library. Any stretch of l or more consecutive As or Ts (here l = 4) is considered a poly dA:dT tract. Thus a sequence with one stretch of 5 As, and no other As or Ts, in the 50 bp variable region has 2 poly dA:dT tracts if l is considered to be 4. The scatter plot indicates that the overall correlation between intrinsic cyclizability and poly dA:dT content is very poor. Only non-overlapping sequences in the chrV library were considered. Pearson’s r = −0.07. 95% CI = [−0.09, −0.05]. p < 0.00001 (t-test, two-sided). d, Binned histogram of the data in panel c (which represents the l = 4 bp case), as well as for more restrictive definitions of poly dA:dT stretches (l = 5 to 10 bp). The y-axis values are the mean intrinsic cyclizabilities of those sequences in the ChrV library which contain the number of poly dA:dT tracts in the central 50 bp variable region as specified along the x-axis. Error bars are s.e.m. For l = 4, there were N = 5081, 2801, 1705, 948, 521, 268, 170 non-overlapping sequences in the ChrV library which had a poly dA:dT content of 0, 1, 2, 3, 4, 5, 6 respectively. For l = 5, the corresponding N values were 8771, 1594, 695, 323, 150, 89, 36. For l = 6, the corresponding N values were 10,523, 655, 295, 117, 62, 25, 32. For l = 7, the corresponding N values were 11,203, 290, 109, 56, 27, 30, 16. For l = 8, the corresponding N values were 11,497, 109, 58, 23, 30, 17, 11. For l = 9, the corresponding N values were 11,608, 57, 23, 29, 19, 13, 7. Because the count of poly dA:dT = 6 is less than 10, this point was left out from the plot. For l = 10, the corresponding N values were 11,665, 23, 30, 21, 11, 6, 1. Again, the intrinsic cyclizability for poly dA:dT content = 5, 6 were left out from the plot because the n value for these were less than 10. e, 2D histogram of the scatter plot of mean GC content along the central 50 bp variable region of sequences in the ChrV library vs their intrinsic cyclizabilities. Pearson’s r = 0.026. 95% CI = [0.019, 0.033]. p < 0.00001 (t-test, two-sided). f, A plot of mean poly dA:dT content (l = 4) as a function of position around the dyads of gene body nucleosomes along chromosome V in S. cerevisiae. The points along the horizontal axis where the nucleosome categories (+1, +2, etc) are marked represent the dyads of the nucleosomes. Light shaded region represents s.e.m. Poly dA:dT content was calculated as described in Supplementary Note 15. g, Mean poly dA:dT content (l = 4) and AA/TT/TA vs position along the native and codon-randomized nucleosomal DNA sequences of the same 500 +7 nucleosomes along which intrinsic cyclizability profiles are reported in figure 3g. h, Predicted Plectoneme Density (PPD, see Supplementary Note 16) (top panel), intrinsic cyclizability (middle panel) and nucleosome occupancy (bottom panel) vs position from the +1 nucleosomal dyad, averaged over all 576 genes in the Tiling Library (supplementary note 9). PPD along each gene was first smoothened using a rolling window of 51 bp. The smoothened PPDs were then averaged at each position across all 576 genes, normalized by the mean, and plotted as the solid line in the top panel. The shaded background represents s.e.m. Intrinsic cyclizability and nucleosome occupancy were plotted exactly as in figure 2b. For the plot of intrinsic cyclizability, the solid blue and black lines represent data without and with a 7-fragment smoothening respectively (as in Fig. 2b). The shaded blue background represents s.e.m of intrinsic cyclizability values in the unsmoothed data.
Figure 1:
Figure 1:. A high-throughout method to measure DNA mechanics.
a, Schematic of the single-molecule looping assay. b, In situ nicking of 120 bp duplex DNA 10 nt from either end, following by washing with buffer at 50 °C, results in 100 bp duplex molecules flanked by 10 nt single-stranded overhangs (Supplementary Note 1). c, Percentage of DNA molecules in the high FRET (looped) state as a function of time after adding high salt, for 10 DNA sequences (Supplementary Note 1). Looping times (time constants of exponential decay fits (solid lines)) are displayed in the bar plot. (d) Schematic of a typical DNA fragment in a library just prior to looping. ‘n’ is the distance in nucleotides of the biotin tether from the end of the molecule (e) Schematic of loop-seq performed on a hypothetical library comprising only two sequences – green and pink. The library is amplified, immobilized on beads via biotin-streptavidin interactions and nicked in situ to generate loopable molecules (Fig. 1d, Methods). After looping for 1 minute in high salt followed by the digestion of unlooped molecules and the amplification of surviving molecules, the relative populations of green and pink in the digested fraction are 2/3 and 1/3 respectively. The corresponding values in the control fraction are 1/2 and 1/2. The cyclizability of green and pink are thus loge2/31/2 and loge1/31/2 respectively. (f) Left panel: Cyclizabilities of 10 sequences (listed in Supplementary Note 1) which were part of the Cerevisiae Nucleosomal Library (Supplementary Note 4) vs their looping times obtained via smFRET (Fig. 1c). Pearson’s r = −0.96. 95% CI = [−0.99, −0.81]. p = 1.5E-5 (t-test, two-sided). Right panel: Violin plot of the cyclizabilities of all 19,907 sequences in the Cerevisiae Nucleosomal Library. Cyclizabilities of sequences whose looping times were measured via smFRET are indicated by ‘x’.
Figure 2:
Figure 2:. DNA mechanics contributes to nucleosome depletion at the NDR and modulates remodeler activities.
(a) Schematic of the Tiling Library. The regions around the TSSs of 576 genes were tiled at 7 bp resolution (Supplementary Note 9). DNA: blue bars, nucleosomes: ovals. (b) Mean intrinsic cyclizability (with (black) and without (grey) any smoothening) and nucleosome occupancy vs position from the canonical location of the dyad of the +1 nucleosome, averaged over all 576 genes in the Tiling Library. Blue dashed line (at −73 bp): edge of the +1 nucleosome, red dashed line: start of the rigid DNA region (approximated as the midpoint between the two red arrows). See supplementary note 10. (c) Intrinsic cyclizability and nucleosome occupancy vs position from the dyads of the +1 nucleosomes of two individual genes (see Extended Data Fig. 5 for more examples). (d) INO80 attempting to slide a +1 nucleosome upstream of its canonical location would be poised to contact the rigid DNA region via its Arp8,. (e) Schematic of the experiment comparing the extent of nucleosome sliding by INO80 on a pair of constructs comprising a nucleosome attached to a rigid or flexible linker region, and distinguished by different fluorophores. Sliding results in centered nucleosomes, which migrate more slowly. See Supplementary Note 11. (f) Three nucleosome sliding experiments were performed involving three pairs of nucleosome constructs as described in figure 2e and supplementary note 11. For each pair, four [INO80] were used (lanes L1 – L4): 2, 6, 9, and 13 nM. Post sliding, nucleosomes were run along a 6% TBE gel, which was imaged separately for Cy3 and Cy5 fluorescence. For each lane ([INO80]), the nucleosome sliding extents in the two constructs in the pair were quantified (bar plots). See figure 2e and supplementary note 11.
Figure 3.
Figure 3.. DNA mechanics impacts chromosome-wide nucleosome organization.
Mean intrinsic cyclizability and nucleosome occupancy vs position from the dyad of the +1 nucleosome, (a) averaged over 185 and 345 genes in the Tiling Library which possess stable and fragile −1 nucleosomes respectively (plotted as in Fig 2b), and (b) averaged over all 227 identified genes along S. cerevisiae chromosome V (Supplementary Note 13). Grey background: s.e.m. (c) Intrinsic cyclizability vs position from the dyad, averaged over all 3,192 nucleosomes along chromosome V (top), or over nucleosomes sorted into quartiles based on reported NCP scores (bottom). See also Extended Data Fig. 8a–b. Solid lines: mean, grey background: s.e.m. (d) Intrinsic cyclizability and nucleosome occupancy (solid lines) vs position along all chromosome V genes (blue), and among 34% of genes with the highest (red) and lowest (green) +1 nucleosome NCP scores. Plots were obtained as in panel b. Dashed lines: edges and dyad of the +1 nucleosome. Shaded backgrounds: s.e.m. (e) Intrinsic cyclizability around nucleosomal dyads that lie within the transcribed region of all identified 227 genes along chromosome V in S. cerevisiae. Solid lines: mean, grey background: s.e.m. See supplementary Note 13 for N values. (f) Mean and s.e.m. of intrinsic cyclizability in a 50 bp window around the dyads (blue) and edges (red, from position −73 till −56 and from +56 till +73) of gene-body nucleosomes. N values are the same as in panel e. (g) Intrinsic cyclizability of the native sequences around the dyads of the 500 +7 nucleosomes represented in Library L (supplementary note 14), and along four sets of codon-altered sequences generated by randomly selecting synonymous codons while considering (the first two) or not considering (the next two) the natural codon-usage frequency. Solid line: mean, smoothened over a 7-fragment rolling window, grey background: s.e.m.
Figure 4:
Figure 4:. TSS-proximal nucleosomes are asymmetric.
(a) Black: Intrinsic cyclizability as a function of position along 601 DNA (supplementary note 18) Blue: abscissa value of the solid horizontal line (−0.196) is the mean intrinsic cyclizability along the 500 native +7 nucleosomal sequences represented in Library L (supplementary note 14). The height of the light blue background (0.011) is twice the s.e.m. (b) Schematic representing the design of the Cerevisiae Nucleosomal Library (supplementary note 4). The library contains DNA fragments taken from the 50 bp immediately to the left and right of the dyads of the ~10,000 nucleosomes in S. cerevisiae that have the highest NCP scores. (c) Mean intrinsic cyclizabilities of the 50 bp DNA fragments that lie immediately adjacent to the TSS proximal (red) or distal (blue) side of the dyads of various categories of nucleosomes (−9 through +9) (see supplementary note 17). Error bars are s.e.m. See supplementary Note 17 for the number of nucleosomes in each category over which data was averaged. (d) A subset of the data as in panel c, where the means were calculated considering only genes among the 10% most (left panel) or least (middle panel) expressed in S. cerevisiae. The right panel is identical to panel b, except for an altered y-axis scale. Error bars are s.e.m. See supplementary note 17.

Comment in

References

    1. Garcia HG et al. Biological consequences of tightly bent DNA: The other life of a macromolecular celebrity. Biopolymers 85, 115–130 (2007). - PMC - PubMed
    1. Krietenstein N et al. Genomic Nucleosome Organization Reconstituted with Pure Proteins. Cell 167, 709–721.e12 (2016). - PMC - PubMed
    1. Shore D, Langowski J & Baldwin RL DNA flexibility studied by covalent closure of short fragments into circles. PNAS 78, 4833–4837 (1981). - PMC - PubMed
    1. Cloutier TE & Widom J Spontaneous sharp bending of double-stranded DNA. Mol. Cell 14, 355–362 (2004). - PubMed
    1. Ha T et al. Probing the interaction between two single molecules: fluorescence resonance energy transfer between a single donor and a single acceptor. Proc. Natl. Acad. Sci. U.S.A 93, 6264–6268 (1996). - PMC - PubMed

Publication types