Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Aug 29;9(1):12541.
doi: 10.1038/s41598-019-48746-5.

Gene Expression Predicts Histological Severity and Reveals Distinct Molecular Profiles of Nonalcoholic Fatty Liver Disease

Affiliations

Gene Expression Predicts Histological Severity and Reveals Distinct Molecular Profiles of Nonalcoholic Fatty Liver Disease

Stephen A Hoang et al. Sci Rep. .

Abstract

The heterogeneity of biological processes driving the severity of nonalcoholic fatty liver disease (NAFLD) as reflected in the transcriptome and the relationship between the pathways involved are not well established. Well-defined associations between gene expression profiles and disease progression would benefit efforts to develop novel therapies and to understand disease heterogeneity. We analyzed hepatic gene expression in controls and a cohort with the full histological spectrum of NAFLD. Protein-protein interaction and gene set variation analysis revealed distinct sets of coordinately regulated genes and pathways whose expression progressively change over the course of the disease. The progressive nature of these changes enabled us to develop a framework for calculating a disease progression score for individual genes. We show that, in aggregate, these scores correlate strongly with histological measures of disease progression and can thus themselves serve as a proxy for severity. Furthermore, we demonstrate that the expression levels of a small number of genes (~20) can be used to infer disease severity. Finally, we show that patient subgroups can be distinguished by the relative distribution of gene-level scores in specific gene sets. While future work is required to identify the specific disease characteristics that correspond to patient clusters identified on this basis, this work provides a general framework for the use of high-content molecular profiling to identify NAFLD patient subgroups.

PubMed Disclaimer

Conflict of interest statement

Arun Sanyal: Dr. Sanyal is President of Sanyal Biotechnology and has stock options in Genfit, Akarna, Tiziana, Indalo, Durect. He has served as a consultant to Hemoshear, Echosens, AbbVie, Astra Zeneca, Nitto Denko, Ardelyx, Conatus, Nimbus, Amarin, Salix, Tobira, Takeda, Novo Nordisk, Fibrogen, Jannsen, Gilead, Boehringer, Lilly, Zafgen, Novartis, Pfizer, Immuron, Exhalenz and Genfit. He has been an unpaid consultant to Intercept, Immuron, Galectin, Fractyl, Syntlogic, Affimune, Chemomab, Nordic Bioscience and Bristol Myers Squibb. His institution has received grant support from Gilead, Salix, Tobira, Bristol Myers, Shire, Echosens, Intercept, Merck, Astra Zeneca, Malinckrodt, Cumberland and Novartis. He receives royalties from Elsevier and UptoDate. Stephen Hoang, Ryan Feaver, Banumathi Cole, Mark Lawson, Nathan Day, Justin Taylor, and Brian Wamhoff are Employees of HemoShear Therapeutics.

Figures

Figure 1
Figure 1
Integration of differentially expressed genes with a protein-protein interaction network highlights hubs involved in the progression of fatty liver disease. (A,B) A protein-protein interaction network induced by the differentially expressed genes for both NAS and fibrosis stage, respectively. Each node represents a densely connected community of proteins, whose size represents the number of proteins in the community. The node labels provide a summary of the biological processes enriched in each community, as well as a number which is a community identifier. Edge thickness is proportional to the number of connections between communities. (C,D) Box plots showing the distribution of eigenvalue centrality in the communities of each network. Communities significantly enriched with hubs (nodes with relatively large centrality) are labeled with their top 5 genes by centrality.
Figure 2
Figure 2
The top Reactome gene sets that are up- and down-regulated with respect to NAS (A) or fibrosis stage (B). The y-axes represent the GSVA score, which is a pathway-level quantification of gene abundance, and the x-axes represent the clinical assessment. For disease activity (NAS), pathways related to apoptosis, inflammation (Fc epsilon receptor signaling, TNFR2 signaling, T cell receptor (TCR)), cell proliferation (PTEN, TP53) were top pathways whereas for insulin receptor substrate (IRS) signaling pathway was downregulated. For fibrosis, Ephrin signaling related genes were the top pathway while amine derived hormones and nicotinic acetylcholine receptor pathways were down-regulated.
Figure 3
Figure 3
Based on the dynamic range of expression and rank order upon ordinal regression of gene expression levels to the NAFLD activity score (NAS) or fibrosis stage, a gene-level score was derived for all genes tested. The distribution of gNAS scores (A,B) and gFib scores (C,D). Plots (A,C) show the distribution of gNAS or gFib scores for the top 1000 genes in each sample. Plots (B,D) show the relationship between mean gNAS and gFib scores and histological assessments.
Figure 4
Figure 4
Lasso regression of gene expression values against mean gNAS (A,C,E) or gFib scores (B,D,F). Figures (A,B) show the results of 5-fold cross-validation for each model, which have 19 and 18 predictors, respectively. The strong performance of the models in cross-validation demonstrates that disease severity can be assessed from the expression levels of a relatively small number of genes. Figures (C,D) provide the scaled variable importance for model predictors. Figures (E,F) show the standardized regression coefficients for each model.
Figure 5
Figure 5
Patterns of gNAS and gFib scores across patient samples reveal distinct molecular profiles. Panels (A,B), respectively, show standardized gNAS and gFib scores across sets of genes that were identified through gene shaving. Sample clusters in these panels show distinct patterns regulation across these genes, and thus represent patients with distinct molecular profiles. Panels (C,D) show the distributions of mean standardized scores for each sample cluster. Within these plots, patterns across gene clusters (x-axis) represent the average molecular profiles of the sample clusters. Panel (E) shows the intersection of the gNAS- and gFib-based sample clusters and provides the number of samples in each cluster pair. Simultaneous consideration of the two partitions provides additional granularity in sample classification. Panels (F,G) show the most strongly represented Reactome pathways in each gene cluster (by Fisher’s exact test). The pathways represented are closely linked to NAFLD progression.
Figure 6
Figure 6
Patterns of pathway-level regulation with respect to gNAS and gFib scores. The heatmaps (A,B) shows the clustering pattern of samples (columns) and MSigDB hallmark pathways (rows) with respect to mean gene-level scores (values represent column-wise Z-scores). Sample clusters show distinct patterns of pathway-level regulation. Panels (C,D) show the mean sample-wise Z-score in each cluster for the gNAS and gFib analyses, respectively. Higher values in both figures are consistent with relatively advanced disease states. Panel E shows the intersection of the gNAS- and gFib-based sample clusters and provides the number of samples in each cluster pair.

References

    1. Chalasani N, et al. The diagnosis and management of nonalcoholic fatty liver disease: Practice guidance from the American Association for the Study of Liver Diseases. Hepatology. 2018;67:328–357. doi: 10.1002/hep.29367. - DOI - PubMed
    1. Kleiner DE, Brunt EM. Nonalcoholic fatty liver disease: pathologic patterns and biopsy evaluation in clinical research. Semin Liver Dis. 2012;32:3–13. doi: 10.1055/s-0032-1306421. - DOI - PubMed
    1. Kleiner DE, et al. Design and validation of a histological scoring system for nonalcoholic fatty liver disease. Hepatology. 2005;41:1313–1321. doi: 10.1002/hep.20701. - DOI - PubMed
    1. Ekstedt M, et al. Long-term follow-up of patients with NAFLD and elevated liver enzymes. Hepatology. 2006;44:865–873. doi: 10.1002/hep.21327. - DOI - PubMed
    1. Estes C, Razavi H, Loomba R, Younossi Z, Sanyal AJ. Modeling the epidemic of nonalcoholic fatty liver disease demonstrates an exponential increase in burden of disease. Hepatology. 2018;67:123–133. doi: 10.1002/hep.29466. - DOI - PMC - PubMed

Publication types