Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Oct 30;7(1):14339.
doi: 10.1038/s41598-017-14682-5.

A Computational Method of Defining Potential Biomarkers based on Differential Sub-Networks

Affiliations

A Computational Method of Defining Potential Biomarkers based on Differential Sub-Networks

Xin Huang et al. Sci Rep. .

Abstract

Analyzing omics data from a network-based perspective can facilitate biomarker discovery. To improve disease diagnosis and identify prospective information indicating the onset of complex disease, a computational method for identifying potential biomarkers based on differential sub-networks (PB-DSN) is developed. In PB-DSN, Pearson correlation coefficient (PCC) is used to measure the relationship between feature ratios and to infer potential networks. A differential sub-network is extracted to identify crucial information for discriminating different groups and indicating the emergence of complex diseases. Subsequently, PB-DSN defines potential biomarkers based on the topological analysis of these differential sub-networks. In this study, PB-DSN is applied to handle a static genomics dataset of small, round blue cell tumors and a time-series metabolomics dataset of hepatocellular carcinoma. PB-DSN is compared with support vector machine-recursive feature elimination, multivariate empirical Bayes statistics, analyzing time-series data based on dynamic networks, molecular networks based on PCC, PinnacleZ, graph-based iterative group analysis, KeyPathwayMiner and BioNet. The better performance of PB-DSN not only demonstrates its effectiveness for the identification of discriminative features that facilitate disease classification, but also shows its potential for the identification of warning signals.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Figure 1
Figure 1
Star sub-networks based on ratio 1. (ad) The star sub-networks consisting of the edges linked with ratio 1 in SG EWS, SG EWS-RMS, SG EWS-BL and SG EWS-NB, respectively. Ratios in the four sub-networks are same. The numbers of the connections with ratio 1 in SG EWS, SG EWS-RMS, SG EWS-BL and SG EWS-NB are 370, 10, 32 and 50.
Figure 2
Figure 2
Statistical analysis of the top 5 ratios. (ae) The statistical analysis (the mean ± the S.E.) of ratio i (1 ≤ i ≤ 5), respectively.
Figure 3
Figure 3
Star sub-networks based on N,N-dimethylglycine/threonic acid and box plot. (ag) The star sub-networks consisting of the edges linked with N,N-dimethylglycine/threonic acid during disease development. Ratios in the seven sub-networks are same. The numbers of the connections with N,N-dimethylglycine/threonic acid in SG i (1 ≤ i ≤ 7) are 4, 5, 4, 18, 36, 4 and 4. (h) The box plot of N,N-dimethylglycine/threonic acid.
Figure 4
Figure 4
Statistical analysis of the 3 metabolite ratios. (ac) The metabolic trajectories (the mean ± the S.E.) of N,N-dimethylglycine/threonic acid, N,N-dimethylglycine/mucic acid and betaine/mucic acid in the training set. (d,e) The ROC curves of these 3 metabolite ratios in the training and test sets.

References

    1. Atkinson AJ, et al. Biomarkers and surrogate endpoints: preferred definitions and conceptual framework. Clin. Pharmacol. Ther. 2001;69:89–95. doi: 10.1067/mcp.2001.113989. - DOI - PubMed
    1. Liu R, Wang X, Aihara K, Chen L. Early diagnosis of complex diseases by molecular biomarkers, network biomarkers, and dynamical network biomarkers. Med. Res. Rev. 2014;34:455–478. doi: 10.1002/med.21293. - DOI - PubMed
    1. Saccenti E, Hoefsloot HCJ, Smilde AK, Westerhuis JA, Hendriks MMWB. Reflections on univariate and multivariate analysis of metabolomics data. Metabolomics. 2013;10:361–374. doi: 10.1007/s11306-013-0598-6. - DOI
    1. Guyon I, Weston J, Barnhill S, Vapnik V. Gene selection for cancer classification using support vector machines. MLear. 2002;46:389–422.
    1. Goldberg DE, Holland JH. Genetic algorithms and machine learning. MLear. 1988;3:95–99.

Publication types

LinkOut - more resources