Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Dec 28;16(1):33.
doi: 10.3390/genes16010033.

Development of a Polygenic Risk Score for Metabolic Dysfunction-Associated Steatotic Liver Disease Prediction in UK Biobank

Affiliations

Development of a Polygenic Risk Score for Metabolic Dysfunction-Associated Steatotic Liver Disease Prediction in UK Biobank

Panagiota Giardoglou et al. Genes (Basel). .

Abstract

Background: Metabolic dysfunction-associated steatotic liver disease (MASLD) is the leading cause of liver-related morbidity and mortality. Although the invasive liver biopsy remains the golden standard for MASLD diagnosis, Magnetic Resonance Imaging-derived Proton Density Fat Fraction (MRI-PDFF) is an accurate, non-invasive method for the assessment of treatment response. This study aimed at developing a Polygenic Risk Score (PRS) to improve MRI-PDFF prediction using UK Biobank data to assess an individual's genetic liability to MASLD.

Methods: We iteratively sequestered 10% of MRI-PDFF samples as a validation set and split the rest of each dataset into base and target partitions, containing GWAS summary statistics and raw genotype data, respectively. PRSice2 was deployed to derive PRS candidates. Based on the frequency of SNP appearances along the PRS candidates, we generated different SNP sets according to variable frequency cutoffs. By applying the PRSs to the validation set, we identified the optimal SNP set, which was then applied to a Greek nonalcoholic fatty liver disease (NAFLD) study.

Results: Data from 3553 UK Biobank participants yielded 49 different SNP sets. After calculating the PRS on the validation set for every SNP set, an optimal PRS with 75 SNPs was selected (incremental R2 = 0.025, p-value = 0.00145). Interestingly, 43 SNPs were successfully mapped to MASLD-related known genes. The selected PRS could predict traits, like LDL cholesterol and diastolic blood pressure in the UK Biobank, as also disease outcome in the Greek NAFLD study.

Conclusions: Our findings provide strong evidence that PRS is a powerful prediction model for MASLD, while it can also be applied on populations of different ethnicity.

Keywords: UK Biobank; metabolic dysfunction-associated steatotic liver disease; polygenic risk score.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflicts of interest.

Figures

Figure 1
Figure 1
Flow chart of UKBB participants who met the inclusion/exclusion criteria for the study.
Figure 2
Figure 2
After completion of iterative and aggregation PRS derivation process, different sets of SNP contents were generated depending on frequency cutoffs. Validation of the 68 generated SNP sets resulted in 20 optimal SNP sets that were statistically significant and exhibited explanatory power for the MRI-PDFF validation dataset. X-axis: number of SNPs per SNP set; y-axis: incremental R2, color-coding denotes the p-value. For the graph illustration, ggplot2 (v 3.5.1) [28] was used.
Figure 3
Figure 3
A final 75 SNPs containing PRS was selected based on high predictive value and low SNP content. The selected PRS was also evaluated for its predictive ability of NASH score in the Greek NAFLD study. The model that was generated when the 75 SNP set of the optimal PRS was applied yielded notable metrics, including p-value = 0.009 and incremental R2 value = 0.003.
Figure 4
Figure 4
A final 75 SNPs containing PRS was selected based on high predictive value and low SNP content. The optimal PRS was applied to the whole set of UK Biobank samples and evaluated for its predicted ability of MRI-PDFF (p-value = 0.001 and incremental R2 value = 0.025).
Figure 5
Figure 5
Phenotypic variance of UKBB explained for MASLD-related markers by the optimal 75 SNP-containing PRS. The graph illustration was performed using the software GraphPad Prism version 5.03.
Figure 6
Figure 6
Enrichment analysis of the gene set in order to investigate molecular functions in which genes associated with the SNP set of the optimal PRS are involved.

References

    1. Rinella M.E., Lazarus J.V., Ratziu V., Francque S.M., Sanyal A.J., Kanwal F., Romero D., Abdelmalek M.F., Anstee Q.M., Arab J.P., et al. A Multisociety Delphi Consensus Statement on New Fatty Liver Disease Nomenclature. J. Hepatol. 2023;79:1542–1556. doi: 10.1016/j.jhep.2023.06.003. - DOI - PubMed
    1. Lazarus J.V., Colombo M., Cortez-Pinto H., Huang T.T.K., Miller V., Ninburg M., Schattenberg J.M., Seim L., Wong V.W.S., Zelber-Sagi S. NAFLD—Sounding the Alarm on a Silent Epidemic. Nat. Rev. Gastroenterol. Hepatol. 2020;17:377–379. doi: 10.1038/s41575-020-0315-7. - DOI - PubMed
    1. Targher G., Byrne C.D., Tilg H. NAFLD and Increased Risk of Cardiovascular Disease: Clinical Associations, Pathophysiological Mechanisms and Pharmacological Implications. Gut. 2020;69:1691–1705. doi: 10.1136/gutjnl-2020-320622. - DOI - PubMed
    1. Targher G., Corey K.E., Byrne C.D., Roden M. The Complex Link between NAFLD and Type 2 Diabetes Mellitus—Mechanisms and Treatments. Nat. Rev. Gastroenterol. Hepatol. 2021;18:599–612. doi: 10.1038/s41575-021-00448-y. - DOI - PubMed
    1. Guo Z., Wu Q., Xie P., Wang J., Lv W. Immunomodulation in Non-Alcoholic Fatty Liver Disease: Exploring Mechanisms and Applications. Front. Immunol. 2024;15:1336493. doi: 10.3389/fimmu.2024.1336493. - DOI - PMC - PubMed

LinkOut - more resources