Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023:2624:19-42.
doi: 10.1007/978-1-0716-2962-8_3.

Predicting Chromatin Interactions from DNA Sequence Using DeepC

Affiliations

Predicting Chromatin Interactions from DNA Sequence Using DeepC

Ron Schwessinger. Methods Mol Biol. 2023.

Abstract

The genome 3D structure is central to understanding how disease-associated genetic variants in the noncoding genome regulate their target genes. Genome architecture spans large-scale structures determined by fine-grained regulatory elements, making it challenging to predict the effects of sequence and structural variants. Experimental approaches for chromatin interaction mapping remain costly and time-consuming, limiting their use for interrogating changes of chromatin architecture associated with genomic variation at scale. Computational models to predict chromatin interactions have either interpreted chromatin at coarse resolution or failed to capture the long-range dependencies of larger sequence contexts. To bridge this gap, we previously developed deepC, a deep neural network approach to predict chromatin interactions from DNA sequence at megabase scale. deepC employs dilated convolutional layers to achieve simultaneously a large sequence context while interpreting the DNA sequence at single base pair resolution. Using transfer learning of convolutional weights trained to predict a compendium of chromatin features across cell types allows deepC to predict cell type-specific chromatin interactions from DNA sequence alone. Here, we present a detailed workflow to predict chromatin interactions with deepC. We detail the necessary data pre-processing steps, guide through deepC model training, and demonstrate how to employ trained models to predict chromatin interactions and the effect of sequence variations on genome architecture.

Keywords: Chromatin interactions; Deep neural networks; DeepC; Gene regulation; Genomic variation; Machine learning.

PubMed Disclaimer

References

    1. Hanssen LLP, Kassouf MT, Oudelaar AM et al (2017) Tissue-specific CTCF-cohesin-mediated chromatin architecture delimits enhancer interactions and function in vivo. Nat Cell Biol 19:952–961. https://doi.org/10.1038/ncb3573 - DOI - PubMed - PMC
    1. Deng W, Lee J, Wang H et al (2012) Controlling long-range genomic interactions at a native Locus by targeted tethering of a looping factor. Cell 149:1233–1244. https://doi.org/10.1016/J.CELL.2012.03.051 - DOI - PubMed - PMC
    1. Lieberman-Aiden E, van Berkum NL, Williams L et al (2009) Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science 326:289–293. https://doi.org/10.1126/science.1181369 - DOI - PubMed - PMC
    1. Rao SSP, Huntley MH, Durand NC et al (2014) A 3D map of the human genome at Kilobase resolution reveals principles of chromatin looping. Cell 159:1665–1680. https://doi.org/10.1016/j.cell.2014.11.021 - DOI - PubMed - PMC
    1. Nora EP, Goloborodko A, Valton AL et al (2017) Targeted degradation of CTCF decouples local insulation of chromosome domains from Genomic compartmentalization. Cell 169:930.e22–944.e22. https://doi.org/10.1016/j.cell.2017.05.004 - DOI

LinkOut - more resources