Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Dec 11;4(12):100700.
doi: 10.1016/j.xgen.2024.100700. Epub 2024 Dec 4.

Deciphering proteins in Alzheimer's disease: A new Mendelian randomization method integrated with AlphaFold3 for 3D structure prediction

Affiliations

Deciphering proteins in Alzheimer's disease: A new Mendelian randomization method integrated with AlphaFold3 for 3D structure prediction

Minhao Yao et al. Cell Genom. .

Abstract

Hidden confounding biases hinder identifying causal protein biomarkers for Alzheimer's disease in non-randomized studies. While Mendelian randomization (MR) can mitigate these biases using protein quantitative trait loci (pQTLs) as instrumental variables, some pQTLs violate core assumptions, leading to biased conclusions. To address this, we propose MR-SPI, a novel MR method that selects valid pQTL instruments using Leo Tolstoy's Anna Karenina principle and performs robust post-selection inference. Integrating MR-SPI with AlphaFold3, we developed a computational pipeline to identify causal protein biomarkers and predict 3D structural changes. Applied to genome-wide proteomics data from 54,306 UK Biobank participants and 455,258 subjects (71,880 cases and 383,378 controls) for a genome-wide association study of Alzheimer's disease, we identified seven proteins (TREM2, PILRB, PILRA, EPHA1, CD33, RET, and CD55) with structural alterations due to missense mutations. These findings offer insights into the etiology and potential drug targets for Alzheimer's disease.

Keywords: 3D structure; AlphaFold3; Alzheimer's disease; Mendelian randomization; causal inference; genome-wide association study; instrumental variable; missense variants; pQTL; proteomics.

PubMed Disclaimer

Conflict of interest statement

Declaration of interests The authors declare no competing interests.

Figures

None
Graphical abstract
Figure 1
Figure 1
Overview of the pipeline First, we apply MR-SPI for each protein to (1) select valid pQTL IVs under the plurality condition and (2) estimate the causal effect on the outcome of interest. Second, we perform the Bonferroni correction procedure for causal protein identification. Third, for each causal protein biomarker, we apply AlphaFold3 to predict the 3D structural alterations due to missense pQTL IVs selected by MR-SPI.
Figure 2
Figure 2
The MR-SPI framework First, MR-SPI selects relevant IVs with strong pQTL-protein associations. Second, each relevant IV provides a ratio estimate of the causal effect and then receives votes on itself to be valid from the other relevant IVs whose degrees of violation of (A2) and (A3) are small under this ratio estimate of causal effect. For example, by assuming pQTL 1 is valid, the slope of the line connecting pQTL 1 and the origin represents the ratio estimate of pQTL 1, and pQTLs 2 and 3 vote for pQTL 1 to be valid because they are close to that line, while pQTLs 4, 5, and 6 vote against it since they are far away from that line. Third, MR-SPI estimates the causal effect by fitting a zero-intercept ordinary least squares regression of pQTL-outcome associations on pQTL-protein associations and construct the robust CI using selected valid pQTL IVs in the maximum clique of the voting matrix, which encodes whether two pQTLs mutually vote for each other to be valid IVs.
Figure 3
Figure 3
Empirical performance of MR-SPI and the other competing MR methods in simulated data with a sample size of 5,000 (A) Boxplot of the percent bias in causal effect estimates. (B) Empirical coverage of 95% CIs. The black dashed line represents the nominal level (95%). (C) Average lengths of 95% CIs.
Figure 4
Figure 4
Causal effect estimates, GO analysis, and structural alteration prediction for putatively causal proteins (A) Volcano plot of associations of plasma proteins with AD using MR-SPI. The horizontal axis represents the estimated effect size (on the log odds ratio scale), and the vertical axis represents the log10(p value). Positive and negative associations are represented by green and red points, respectively. The size of a point is proportional to log10(p value). The blue dashed line represents the significance threshold using Bonferroni correction (p<5.48×105). (B) 3D Structural alterations of CD33 predicted by AlphaFold3 due to missense genetic variation of pQTL rs2455069. The ribbon representation of 3D structures of CD33 with arginine and glycine at position 69 are colored in blue and red, respectively. The amino acids at position 69 are displayed in stick representation, with arginine and glycine colored in green and yellow, respectively. The predicted template modeling yields a score of 0.6 for both structures, which suggests that AlphaFold3 provides good predictions for these two 3D structures. (C) Forest plot of significant associations of proteins with AD identified by MR-SPI. The error bars represent the 95% CIs of MR-SPI and other competing MR methods, with CIs clipped to vertical axis limits. (D) Bubble plot of GO analysis results using the 7 significant proteins detected by MR-SPI. The horizontal axis represents the Z score of the enriched GO term, and the vertical axis represents the log10(p value) after Bonferroni correction. Each point represents one enriched GO term. The blue dashed line represents the significance threshold (adjusted p<0.05 after Bonferroni correction).

References

    1. Self W.K., Holtzman D.M. Emerging diagnostics and therapeutics for Alzheimer disease. Nat. Med. 2023;29:2187–2199. - PubMed
    1. GBD 2019 Dementia Forecasting Collaborators Estimation of the global prevalence of dementia in 2019 and forecasted prevalence in 2050: an analysis for the Global Burden of Disease Study 2019. Lancet Public Health. 2022;7:e105–e125. - PMC - PubMed
    1. Van Dyck C.H., Swanson C.J., Aisen P., Bateman R.J., Chen C., Gee M., Kanekiyo M., Li D., Reyderman L., Cohen S., et al. Lecanemab in early Alzheimer’s disease. N. Engl. J. Med. 2023;388:9–21. - PubMed
    1. Sims J.R., Zimmer J.A., Evans C.D., Lu M., Ardayfio P., Sparks J., Wessels A.M., Shcherbinin S., Wang H., Monkul Nery E.S., et al. Donanemab in early symptomatic Alzheimer disease: the TRAILBLAZER-ALZ 2 randomized clinical trial. JAMA. 2023;330:512–527. - PMC - PubMed
    1. Hardy J.A., Higgins G.A. Alzheimer's disease: the amyloid cascade hypothesis. Science. 1992;256:184–185. - PubMed