Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Jan 9;9(1):e84955.
doi: 10.1371/journal.pone.0084955. eCollection 2014.

Clustering gene expression regulators: new approach to disease subtyping

Affiliations

Clustering gene expression regulators: new approach to disease subtyping

Mikhail Pyatnitskiy et al. PLoS One. .

Abstract

One of the main challenges in modern medicine is to stratify different patient groups in terms of underlying disease molecular mechanisms as to develop more personalized approach to therapy. Here we propose novel method for disease subtyping based on analysis of activated expression regulators on a sample-by-sample basis. Our approach relies on Sub-Network Enrichment Analysis algorithm (SNEA) which identifies gene subnetworks with significant concordant changes in expression between two conditions. Subnetwork consists of central regulator and downstream genes connected by relations extracted from global literature-extracted regulation database. Regulators found in each patient separately are clustered together and assigned activity scores which are used for final patients grouping. We show that our approach performs well compared to other related methods and at the same time provides researchers with complementary level of understanding of pathway-level biology behind a disease by identification of significant expression regulators. We have observed the reasonable grouping of neuromuscular disorders (triggered by structural damage vs triggered by unknown mechanisms), that was not revealed using standard expression profile clustering. For another experiment we were able to suggest the clusters of regulators, responsible for colorectal carcinoma vs adenoma discrimination and identify frequently genetically changed regulators that could be of specific importance for the individual characteristics of cancer development. Proposed approach can be regarded as biologically meaningful feature selection, reducing tens of thousands of genes down to dozens of clusters of regulators. Obtained clusters of regulators make possible to generate valuable biological hypotheses about molecular mechanisms related to a clinical outcome for individual patient.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: Mikhail Pyatnitskiy, Ilya Mazo, Elena Schwartz and Ekaterina Kotelnikova are employed by Ariadne Diagnostics LLC, 9430 Key West Avenue, Suite 115 Rockville, Maryland 20850, USA. Maria Shkrob is employed by Elsevier Inc, 9430 Key West Avenue, Suite 113, Rockville, Maryland, 20850, USA. This does not alter the authors' adherence to all the PLOS ONE policies on sharing data and materials.

Figures

Figure 1
Figure 1. Overall pipeline of the proposed approach for disease subtyping.
See corresponding section for detailed description.
Figure 2
Figure 2. Heatmap of activity scores (k-values) for clusters of regulators identified in GSE4183 dataset.
Samples are in columns, clusters of regulators are in rows. Horizontal side bar color encodes true class labels.
Figure 3
Figure 3. Comparison of clustering of 12 diseases of human muscle.
A) Dendrogram obtained using proposed approach based on analysis of regulators activity. B) Dendrogram obtained using Ward's method for clustering gene expression data.

Similar articles

Cited by

References

    1. Haury AC, Gestraud P, Vert JP (2011) The influence of feature selection methods on accuracy, stability and interpretability of molecular signatures. PLoS One 6: e28210. - PMC - PubMed
    1. Michiels S, Koscielny S, Hill C (2005) Prediction of cancer outcome with microarrays: a multiple random validation strategy. Lancet 365: 488–492. - PubMed
    1. Ein-Dor L, Kela I, Getz G, Givol D, Domany E (2005) Outcome signature genes in breast cancer: is there a unique set? Bioinformatics 21: 171–178. - PubMed
    1. Hummel M, Metzeler KH, Buske C, Bohlander SK, Mansmann U (2008) Association between a prognostic gene signature and functional gene sets. Bioinform Biol Insights 2: 329–341. - PMC - PubMed
    1. Venet D, Dumont JE, Detours V (2011) Most random gene expression signatures are significantly associated with breast cancer outcome. PLoS Comput Biol 7: e1002240. - PMC - PubMed

Publication types

MeSH terms