Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 May 30;16(1):5043.
doi: 10.1038/s41467-025-60275-6.

Multi-tissue expression and splicing data prioritise anatomical subsite- and sex-specific colorectal cancer susceptibility genes

Affiliations

Multi-tissue expression and splicing data prioritise anatomical subsite- and sex-specific colorectal cancer susceptibility genes

Emma Hazelwood et al. Nat Commun. .

Abstract

Genome-wide association studies have suggested numerous colorectal cancer (CRC) susceptibility genes, but their causality and therapeutic potential remain unclear. To prioritise causal associations between gene expression/splicing and CRC risk (52,775 cases; 45,940 controls), we perform a transcriptome-wide association study (TWAS) across six tissues with Mendelian randomisation and colocalisation, integrating sex- and anatomical subsite-specific analyses. Here we reveal 37 genes with robust causal links to CRC risk, ten of which have not previously been reported by TWAS. Most likely causal genes with evidence of cancer cell dependency show elevated expression linked to risk, suggesting therapeutic potential. Notably, SEMA4D, encoding a protein targeted by an investigational CRC therapy, emerges as a key risk gene. We also identify a female-specific association with CRC risk for CCM2 expression and subsite-specific associations, including LAMC1 with rectal cancer risk. These findings offer valuable insights into CRC molecular mechanisms and support promising therapeutic avenues.

PubMed Disclaimer

Conflict of interest statement

Competing interests: T.G.R. is employed full-time by GlaxoSmithKline outside of the research presented in this manuscript. Where authors are identified as personnel of the International Agency for Research on Cancer/World Health Organization, the authors alone are responsible for the views expressed in this article and they do not necessarily represent the decisions, policy, or views of the International Agency for Research on Cancer/World Health Organization. This article is the result of the scientific work of Dr Murphy while he was affiliated at IARC. The remaining authors declare no competing interests.

Figures

Fig. 1
Fig. 1. Overview of multi-tissue TWAS, colocalization, and MR-based gene prioritisation for colorectal cancer risk.
a Flowchart showing analysis overview and number of genes/splicing events identified at each stage. “Genes with robust evidence” includes those that had H4 above 0.8 in colocalisation analyses, and which either passed Bonferroni correction in the relevant MR analysis (p < 4.38 × 10−5; 0.05/N*G where N is the number of gene-tissue pairs (161) and G is the number of CRC GWAS (7) for genes identified in TWAS analyses or p < 1.32 × 10−4; 0.05/number of druggable genes with suitable genetic instruments available (380) for genes identified as part of the druggable genome) or which did not have suitable instruments available to be included in the MR analysis. MR Mendelian randomisation. b Manhattan plot showing results of S-MultiXcan and JTI TWAS analyses of colorectal cancer risk, for all anatomical subsites combined. Where genes were identified in multiple TWAS analyses, the one with the lowest p value was retained. Genes labelled are those prioritised following subsequent analyses. All statistical tests were two-sided with the unadjusted p values from S-MultiXcan or JTI plotted. c Venn diagram showing overlap of final prioritised 37 genes identified by each TWAS analysis. JTI joint tissue imputation, eQTLs expression quantitative trait loci, sQTLs splice quantitative trait loci. Source data are provided as a Source Data file.
Fig. 2
Fig. 2. Forest plots of JTI effect sizes across colorectal cancer anatomical subsites and sex for anatomical subsite- and sex-specific genes identified by JTI TWAS analysis.
Relevant tissue-specific estimates from JTI for risk of each anatomical subsite are plotted with 95% confident intervals. (sample sizes were 52,775 cases, 45,940 controls for overall; for all anatomical subsites there were 43,099 controls; colon, 28,736 cases; proximal colon, 14,416 cases; distal colon, 12,879 cases; and rectal, 14,150 cases; female, 24,594 cases, 23,936 controls; male, 28,271 cases, 22,351 controls). Solid points indicate the Bonferroni p value threshold of p < 6.01 × 10−8 was met in the JTI analysis. Errors bars may be hidden by the point estimate where the standard deviation is small relative to effect estimates. A AAMP expression in adipose subcutaneous tissue; B AAMP expression in adipose visceral tissue; C AAMP expression in colon sigmoid tissue; D COLCA1 expression in colon transverse tissue; E EPM2AIP1 expression in adipose visceral tissue; F EPM2AIP1 expression in whole blood; G LAMC1 expression in whole blood; H MLH1 expression in adipose subcutaneous tissue; I MLH1 expression in adipose visceral tissue; J MLH1 expression in whole blood; K MLH1 expression in lymphocytes; L RP11-129K12.1 expression in adipose subcutaneous expression; M RP11-129K12.1 expression in colon transverse tissue; N CCM2 expression in whole blood. Source data are provided as a Source Data file.
Fig. 3
Fig. 3. Forest plots of mean Z-score estimates from S-MultiXcan across colorectal cancer anatomical subsites for anatomical subsite-specific genes identified by S-MultiXcan (expression or splicing) TWAS analysis.
Relevant estimates for risk of each anatomical subsite are plotted with 95% confident intervals (sample sizes were 52,775 cases, 45,940 controls for overall; for all anatomical subsites there were 43,099 controls; colon, 28,736 cases; proximal colon, 14,416 cases; distal colon, 12,879 cases; and rectal, 14,150 cases; female, 24,594 cases, 23,936 controls; male, 28,271 cases, 22,351 controls). Solid points indicate the Bonferroni p value threshold of p < 3.91 × 10−7 was met in S-MultiXcan eQTL analysis or p < 5.49 × 10−7 was met in S-MultiXcan sQTL analysis. Errors bars may be hidden by the point estimate where the standard deviation is small relative to Z-score scale. A ABCC2 expression; B MLH1 expression; C RP11-129K12.1 expression; D ARPC2 splicing; E GPATCH1 splicing. Source data are provided as a Source Data file.

References

    1. Morgan, E. et al. Global burden of colorectal cancer in 2020 and 2040: incidence and mortality estimates from GLOBOCAN. Gut72, 338–344 (2023). - PubMed
    1. Bull, C. J. et al. Adiposity, metabolites, and colorectal cancer risk: Mendelian randomization study. BMC Med. 18, 396 (2020). - PMC - PubMed
    1. Florensa, D. et al. Acetylsalicylic acid effect in colorectal cancer taking into account the role of tobacco, alcohol and excess weight. Int. J. Environ. Res. Public Health20, 4104 (2023). - PMC - PubMed
    1. Johnson, C. M. et al. Meta-analyses of colorectal cancer risk factors. Cancer Causes Control24, 1207–1222 (2013). - PMC - PubMed
    1. Lauby-Secretan, B. et al. Body fatness and cancer-viewpoint of the IARC Working Group. N. Engl. J. Med.375, 794–798 (2016). - PMC - PubMed

LinkOut - more resources