Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Jul 17;17(1):135.
doi: 10.1186/s12916-019-1364-z.

GWAS and enrichment analyses of non-alcoholic fatty liver disease identify new trait-associated genes and pathways across eMERGE Network

Affiliations

GWAS and enrichment analyses of non-alcoholic fatty liver disease identify new trait-associated genes and pathways across eMERGE Network

Bahram Namjou et al. BMC Med. .

Abstract

Background: Non-alcoholic fatty liver disease (NAFLD) is a common chronic liver illness with a genetically heterogeneous background that can be accompanied by considerable morbidity and attendant health care costs. The pathogenesis and progression of NAFLD is complex with many unanswered questions. We conducted genome-wide association studies (GWASs) using both adult and pediatric participants from the Electronic Medical Records and Genomics (eMERGE) Network to identify novel genetic contributors to this condition.

Methods: First, a natural language processing (NLP) algorithm was developed, tested, and deployed at each site to identify 1106 NAFLD cases and 8571 controls and histological data from liver tissue in 235 available participants. These include 1242 pediatric participants (396 cases, 846 controls). The algorithm included billing codes, text queries, laboratory values, and medication records. Next, GWASs were performed on NAFLD cases and controls and case-only analyses using histologic scores and liver function tests adjusting for age, sex, site, ancestry, PC, and body mass index (BMI).

Results: Consistent with previous results, a robust association was detected for the PNPLA3 gene cluster in participants with European ancestry. At the PNPLA3-SAMM50 region, three SNPs, rs738409, rs738408, and rs3747207, showed strongest association (best SNP rs738409 p = 1.70 × 10- 20). This effect was consistent in both pediatric (p = 9.92 × 10- 6) and adult (p = 9.73 × 10- 15) cohorts. Additionally, this variant was also associated with disease severity and NAFLD Activity Score (NAS) (p = 3.94 × 10- 8, beta = 0.85). PheWAS analysis link this locus to a spectrum of liver diseases beyond NAFLD with a novel negative correlation with gout (p = 1.09 × 10- 4). We also identified novel loci for NAFLD disease severity, including one novel locus for NAS score near IL17RA (rs5748926, p = 3.80 × 10- 8), and another near ZFP90-CDH1 for fibrosis (rs698718, p = 2.74 × 10- 11). Post-GWAS and gene-based analyses identified more than 300 genes that were used for functional and pathway enrichment analyses.

Conclusions: In summary, this study demonstrates clear confirmation of a previously described NAFLD risk locus and several novel associations. Further collaborative studies including an ethnically diverse population with well-characterized liver histologic features of NAFLD are needed to further validate the novel findings.

Keywords: Fatty liver; GWAS; Genetic polymorphism; NAFLD; PheWAS; Polygenic risk score.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Fig. 1
Fig. 1
a, b Manhattan plot (a) and Q–Q plot (b) of genome-wide markers for NAFLD in European ancestry (1106 cases and 8571 controls). A total of 1106 cases of NAFLD and 8571 controls were analyzed after quality control. Logistic regression analysis was performed for 7,261,527 variants with MAF > 1% assuming an additive genetic model, adjusted for age, sex, BMI, genotyping site, and genetic ancestry (principal components 1 through 3). Results are plotted as –log10 p values on the y-axis by position in chromosome (x-axis) (NCBI build 37)
Fig. 2
Fig. 2
a–c LocusZoom plot of the associations signals in three previously known regions for NAFLD. a Confirmation at 22q13 for PNPLA3. SNP rs738409 is a missense variation (I148M) in PNPLA3 produced the best effect (p = 1.70 × 10− 20). b Detected signal at 19p12 (GATAD2A, NCAN, TM6SF2) region. The best marker in this study was rs56408111 (p = 5.26 × 10− 6). The linkage disequilibrium (LD) between rs56408111 and previously known SNP rs4808199 was r2 = 0.24, D’ = 0.74. c Detected signal at 8q24 (TRIB1) genetic region. The best marker in this study (rs2980888) is shown (see also Additional file 1: Table S2). Estimated recombination rates (from HapMap) are plotted in cyan to reflect the local LD structure. The SNPs surrounding the most significant variant are color-coded to reflect their LD with the index SNP (taken from pairwise r2 values from the HapMap CEU database, www.hapmap.org). Regional plots were generated using LocusZoom (http://csg.sph.umich.edu/locuszoom)
Fig. 3
Fig. 3
a Means and standard deviations of NAS and fibrosis score (0–12) stratified by genotype of rs738409 at PNPLA3 in 235 NAFLD cases. The results are plotted as the sum of NAS and fibrosis score (0–12) (y-axis) against the three genotypes of rs738409 C>G polymorphism (x-axis). The results are further sub-divided by age groups (pediatrics, adult, and all). Results for IL17RA (b) and ZFP90 (c) also are shown
Fig. 4
Fig. 4
a–d Regional association plots of best effects in case-only linear regression analyses for continuous traits of NAS score, fibrosis, and ALT liver enzyme, respectively. a The best observed effect near the IL17RA region for NAS score. b The most significant effects at 16q22 near ZFP90 gene for fibrosis. c The effect near FABP1 locus for fibrosis. d An effect at 2p22 near XDH for AST liver enzyme
Fig. 5
Fig. 5
NAFLD case-control gene-based results using MAGMA as a base and tissue-specific gene expression (GTEx v7 with 30 general tissue types) as a source produced specific enrichment in liver (see “Methods”). List of all MAGMA gene-based results (P < 0.05) is shown in Additional file 1: Table S5
Fig. 6
Fig. 6
a–d ROC graphical plot that illustrates the diagnostic ability of the binary classifier NAFLD (cases and controls) and NAS score (above and below 5) using weighted GRS score of ten previously published SNPs (GRS-10, see “Results”). The sensitivity and specificity and AUC measures for each plot are also shown. a ROC curve for NAFLD-1106 cases and 8571 controls. b ROC curve for NAS score (79 cases above NAS score ≥ 5 versus 156 controls with score < 5). c Adding SNP rs5748926 near IL17RA improved the ROC curves for NAS score (GRS_11); difference between areas 0.035 (SE = 0.012, p = 0.004). d Distribution of quantiles of weighted 10-SNP GRS in NAFLD (cases and controls) and NAS score (above and below 5); percentage of NAFLD risk increases by increasing GRS quantiles; for NAFLD (cases and controls) from 17% in Q1 to 36% in Q4 (OR = 2.16, 95% CI = 1.81–2.58, p < 0.0001); for NAS score above 5 (defined as case) from 10% in Q1 to 43% in Q4 (OR = 8.50, 95% CI 3.45–20.96). The weighted 10-SNP GRS was calculated by multiplying the sum of the number of risk alleles (0, 1, 2) with the allele-specific effect sizes (beta coefficients) obtained from previous publications (see “Methods”)

References

    1. Loomba R, Sanyal AJ. The global NAFLD epidemic. Nat Rev Gastroenterol Hepatol. 2013;10:686–690. doi: 10.1038/nrgastro.2013.171. - DOI - PubMed
    1. Masuoka HC, Chalasani N. Nonalcoholic fatty liver disease: an emerging threat to obese and diabetic individuals. Ann N Y Acad Sci. 2013;1281:106–122. doi: 10.1111/nyas.12016. - DOI - PMC - PubMed
    1. Sanyal AJ, Brunt EM, Kleiner DE, Kowdley KV, Chalasani N, Lavine JE, et al. Endpoints and clinical trial design for nonalcoholic steatohepatitis. Hepatol. 2011;54:344–353. doi: 10.1002/hep.24376. - DOI - PMC - PubMed
    1. Anstee QM, Targher G, Day CP. Progression of NAFLD to diabetes mellitus, cardiovascular disease or cirrhosis. Nat Rev Gastroenterol Hepatol. 2013;10:330–344. doi: 10.1038/nrgastro.2013.41. - DOI - PubMed
    1. Younes R, Bugianesi E. NASH in lean individuals. Semin Liver Dis. 2019;39:86–95. doi: 10.1055/s-0038-1677517. - DOI - PubMed

Publication types

MeSH terms