Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2022 Sep 28;14(1):120.
doi: 10.1186/s13148-022-01342-3.

Interpretation of the role of germline and somatic non-coding mutations in cancer: expression and chromatin conformation informed analysis

Affiliations
Review

Interpretation of the role of germline and somatic non-coding mutations in cancer: expression and chromatin conformation informed analysis

Michael Pudjihartono et al. Clin Epigenetics. .

Abstract

Background: There has been extensive scrutiny of cancer driving mutations within the exome (especially amino acid altering mutations) as these are more likely to have a clear impact on protein functions, and thus on cell biology. However, this has come at the neglect of systematic identification of regulatory (non-coding) variants, which have recently been identified as putative somatic drivers and key germline risk factors for cancer development. Comprehensive understanding of non-coding mutations requires understanding their role in the disruption of regulatory elements, which then disrupt key biological functions such as gene expression.

Main body: We describe how advancements in sequencing technologies have led to the identification of a large number of non-coding mutations with uncharacterized biological significance. We summarize the strategies that have been developed to interpret and prioritize the biological mechanisms impacted by non-coding mutations, focusing on recent annotation of cancer non-coding variants utilizing chromatin states, eQTLs, and chromatin conformation data.

Conclusion: We believe that a better understanding of how to apply different regulatory data types into the study of non-coding mutations will enhance the discovery of novel mechanisms driving cancer.

Keywords: Cancer; Chromosome conformation; GWAS; Germline mutation; Hi-C; Non-coding mutation; Somatic mutation; eQTL.

PubMed Disclaimer

Conflict of interest statement

All authors have seen and approved the final manuscript. They do not have any competing interests to declare.

Figures

Fig. 1
Fig. 1
An overview of cis-regulatory elements and the experimental methods to identify them
Fig. 2
Fig. 2
Methods to link enhancers to their target genes
Fig. 3
Fig. 3
Expression quantitative trait locus. A An expression QTL arises due to DNA variation (single nucleotide polymorphism; SNP) modifying the transcription level of target gene X in an allele-specific manner. B Association between a SNP’s genotype and the expression level of target gene X in hundreds of individuals
Fig. 4
Fig. 4
An overview of TWAS pipeline (PrediXcan [116]). The general method of TWAS is composed of three steps. First, using individual-level genotype and matching gene expression data from a reference eQTL dataset, predictive models are trained to estimate the expression level of each gene based on local genotype. Second, the models are used to predict (or “impute”) the expression level of genes (normally not captured in GWAS) for each individual-level genotype in a GWAS dataset. Third, an association test is conducted for each predicted expression with the trait to elucidate gene–trait associations. The first step has been improved by subsequent methods, which allow summary-level GWAS data as input (e.g., FUSION [117], S-PrediXcan [118], MOSTWAS [119], and UTMOST [120]). For example, UTMOST takes summary-level data and simultaneously trains models across multiple tissues to increase power [120]
Fig. 5
Fig. 5
The non-random packaging of chromatin inside the nucleus. A On the nuclear scale, each chromosome occupies individual regions, termed chromosomal territories [135]. B Within these chromosomes, chromatin clusters into transcriptionally active (“A”) and inactive (“B”) compartments [136]. C Within these compartments, further organization occurs in the form of megabase-long loop structures called “topologically associating domains” (TADs) [137]. TADs are highly conserved between cell types and tend to insulate enhancers and genes contained within it from elements outside of the TAD, thereby preventing inappropriate enhancer–promoter contacts [138]. D Finally, TADs are further compartmentalized into smaller sub-TAD loops that frequently facilitate enhancer–promoter interactions. Unlike TADs, these smaller loops are more cell-type specific [139]. Generally, it is thought that TAD and sub-TAD loops are formed by the interaction of CTCF DNA binding proteins and cohesin ring-shaped complexes that bring distant chromatin regions into physical proximity [140]. However, even this is a simplistic model as further evidence suggest the involvement of many other factors. For example, recent evidence suggest that sub-TAD loops are more commonly stabilized by YY1 proteins in a manner analogous to CTCF [141]. Another evidence shows the involvement of ZNF143 as a chromatin-looping factor that bind to promoter and establish loops through interaction with enhancer-bound CTCF and cohesin [142]. Overall, the mechanisms behind chromatin looping are still an active area of research
Fig. 6
Fig. 6
Illustration of two different recurrence model. A Site-level recurrence model: three point mutations from three individual tumor samples are clustered on the same site, making it a hotspot of somatic mutations. B Gene-level recurrence model: three point mutations from three individual tumor samples are scattered on three different regulatory elements, but when spatial conformation is taken into account, those mutations converge on the same target gene. Figure is adapted from [175]
Fig. 7
Fig. 7
Many trans-eQTLs are also cis-eQTLs. A SNP may first alter the expression level of a cis diffusible mediator, such as a transcription factor, which then alters the expression of the trans genes through altering the binding of the transcription factor to its binding sites near the downstream trans genes

Similar articles

Cited by

References

    1. Rubin CM. The genetic basis of human cancer. Ann Intern Med. 1998;129(9):759. doi: 10.7326/0003-4819-129-9-199811010-00045. - DOI
    1. Martincorena I, Campbell PJ. Somatic mutation in cancer and normal cells. Science (1979) 2015;349(6255):1483–1489. doi: 10.1126/science.aab4082. - DOI - PubMed
    1. Hanahan D, Weinberg RA. Hallmarks of cancer: the next generation. Cell. 2011;144(5):646–674. doi: 10.1016/j.cell.2011.02.013. - DOI - PubMed
    1. Vogelstein B, Papadopoulos N, Velculescu VE, Zhou S, Diaz LA, Kinzler KW. Cancer genome landscapes. Science (1979) 2013;340(6127):1546–1558. doi: 10.1126/science.1235122. - DOI - PMC - PubMed
    1. Hall JM, Lee MK, Newman B, et al. Linkage of early-onset familial breast cancer to chromosome 17q21. Science (1979) 1990;250(4988):1684–1689. doi: 10.1126/science.2270482. - DOI - PubMed

Publication types