Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Feb 24;24(1):64.
doi: 10.1186/s12859-023-05170-x.

Does multi-way, long-range chromatin contact data advance 3D genome reconstruction?

Affiliations

Does multi-way, long-range chromatin contact data advance 3D genome reconstruction?

Adam B Olshen et al. BMC Bioinformatics. .

Abstract

Background: Methods for inferring the three-dimensional (3D) configuration of chromatin from conformation capture assays that provide strictly pairwise interactions, notably Hi-C, utilize the attendant contact matrix as input. More recent assays, in particular split-pool recognition of interactions by tag extension (SPRITE), capture multi-way interactions instead of solely pairwise contacts. These assays yield contacts that straddle appreciably greater genomic distances than Hi-C, in addition to instances of exceptionally high-order chromatin interaction. Such attributes are anticipated to be consequential with respect to 3D genome reconstruction, a task yet to be undertaken with multi-way contact data. However, performing such 3D reconstruction using distance-based reconstruction techniques requires framing multi-way contacts as (pairwise) distances. Comparing approaches for so doing, and assessing the resultant impact of long-range and multi-way contacts, are the objectives of this study.

Results: We obtained 3D reconstructions via multi-dimensional scaling under a variety of weighting schemes for mapping SPRITE multi-way contacts to pairwise distances. Resultant configurations were compared following Procrustes alignment and relationships were assessed between associated Procrustes root mean square errors and key features such as the extent of multi-way and/or long-range contacts. We found that these features had surprisingly limited influence on 3D reconstruction, a finding we attribute to their influence being diminished by the preponderance of pairwise contacts.

Conclusion: Distance-based 3D genome reconstruction using SPRITE multi-way contact data is not appreciably affected by the weighting scheme used to convert multi-way interactions to pairwise distances.

Keywords: Conformation capture; Multi-dimensional scaling; Pairwise distance; Procrustes alignment; SPRITE.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Fig. 1
Fig. 1
Contact frequency by maximum distance between reads for standard Hi-C [3] and SPRITE stratified by k-mer size
Fig. 2
Fig. 2
Barplots of a k-mer sizes (green > 3, red 3, black 2) and b distances (quintiles with first quintile black and fifth quintile cyan) of k-mers by chromosome. The first quintile ranged from 18 to 27%, the second from 17 to 26%, the third from 18 to 24%, the fourth from 17 to 24%, and the fifth from 5 to 27%
Fig. 3
Fig. 3
Procrustes RMSEs as a function of length, percentage many-contacts and percentage long-distance contacts. Least squares fits relate these features to RMSEs obtained using the N-W 3D reconstruction as target, with black and green designating aligned U-W and O-W 3D reconstructions, respectively
Fig. 4
Fig. 4
Procrustes RMSEs for a standard Hi-C reconstruction target compared to SPRITE reconstructions using U-W (black U), N-W (red N), and O-W (green O)
Fig. 5
Fig. 5
Heatmaps depicting contact matrices (of square root transformed counts) for the Hist1 region of Chromosome 6. Scales (not shown) differ according to assay type, weighting scheme and threshold (saturation) value. Darker pixels are for higher contact values, except white is for saturation. Regions of interest are defined by pairs of dark green, lime green and turquoise lines
Fig. 6
Fig. 6
Same as Fig. 5. but for the contact matrices for the three consecutive CTCF anchored loops on Chromosome 8

Similar articles

References

    1. Lieberman-Aiden E, van Berkum NL, Williams L, Imakaev M, Ragoczy T, Telling A, Amit I, Lajoie BR, Sabo PJ, Dorschner MO, Sandstrom R, Bernstein B, Bender MA, Groudine M, Gnirke A, Stamatoyannopoulos J, Mirny LA, Lander ES, Dekker J. Comprehensive mapping of long-range contacts reveals folding principles of the human genome. Science. 2009;326:289–293. doi: 10.1126/science.1181369. - DOI - PMC - PubMed
    1. Duan Z, Andronescu M, Schutz K, McIlwain S, Kim YJ, Lee C, Shendure J, Fields S, Blau CA, Noble WS. A three-dimensional model of the yeast genome. Nature. 2010;465:363–367. doi: 10.1038/nature08973. - DOI - PMC - PubMed
    1. Rao SS, Huntley MH, Durand NC, Stamenova EK, Bochkov ID, Robinson JT, Sanborn AL, Machol I, Omer AD, Lander ES, Aiden EL. A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell. 2014;159:1665–1680. doi: 10.1016/j.cell.2014.11.021. - DOI - PMC - PubMed
    1. Marco A, Meharena HS, Dileep V, Raju RM, Davila-Velderrain J, Zhang AL, Adaikkan C, Young JZ, Gao F, Kellis M, Tsai LH. Mapping the epigenomic and transcriptomic interplay during memory formation and recall in the hippocampal engram ensemble. Nat Neurosci. 2020;23:1606–1617. doi: 10.1038/s41593-020-00717-0. - DOI - PMC - PubMed
    1. Ay F, Bunnik EM, Varoquaux N, Bol SM, Prudhomme J, Vert JP, Noble WS, Le Roch KG. Three-dimensional modeling of the P. falciparum genome during the erythrocytic cycle reveals a strong connection between genome architecture and gene expression. Genome Res. 2014;24:974–88. doi: 10.1101/gr.169417.113. - DOI - PMC - PubMed