Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Feb:90:107425.
doi: 10.1016/j.compbiolchem.2020.107425. Epub 2020 Dec 8.

Prediction of an outcome using NETwork Clusters (NET-C)

Affiliations

Prediction of an outcome using NETwork Clusters (NET-C)

Jai Woo Lee et al. Comput Biol Chem. 2021 Feb.

Abstract

Birth weight is a key consequence of environmental exposures and metabolic alterations and can influence lifelong health. While a number of methods have been used to examine associations of trace element (including essential nutrients and toxic metals) concentrations or metabolite concentrations with a health outcome, birth weight, studies evaluating how the coexistence of these factors impacts birth weight are extremely limited. Here, we present a novel algorithm NETwork Clusters (NET-C), to improve the prediction of outcome by considering the interactions of features in the network and then apply this method to predict birth weight by jointly modelling trace element and cord blood metabolite data. Specifically, by using trace element and/or metabolite subnetworks as groups, we apply group lasso to estimate birth weight. We conducted statistical simulation studies to examine how both sample size and correlations between grouped features and the outcome affect prediction performance. We showed that in terms of prediction error, our proposed method outperformed other methods such as (a) group lasso with groups defined by hierarchical clustering, (b) random forest regression and (c) neural networks. We applied our method to data ascertained as part of the New Hampshire Birth Cohort Study on trace elements, metabolites and birth outcomes, adjusting for other covariates such as maternal body mass index (BMI) and enrollment age. Our proposed method can be applied to a variety of similarly structured high-dimensional datasets to predict health outcomes.

Keywords: Dimensionality reduction; Gaussian graphical model; Lasso; Metabolic network; Outcome prediction; Trace element exposures.

PubMed Disclaimer

Conflict of interest statement

Disclosures and Ethics

As a requirement of publication author(s) have provided to the publisher signed confirma-tion of compliance with legal and ethical obligations including but not limited to the fol-lowing: authorship and contributorship, conflicts of interest, privacy and confidentiality and (where applicable) protection of human and animal research subjects. The authors have read and confirmed their agreement with the ICMJE authorship and conflict of inter-est criteria. The authors have also confirmed that this article is unique and not under con-sideration or published in any other publication, and that they have permission from rights holders to reproduce any copyrighted material. Any disclosures are made in this section.

Figures

Figure 1:
Figure 1:
The matrix representation of three large clusters, three medium clusters and three small clusters. The strength of feature interactions in each square submatrix, C[i,j], in the matrix is randomly chosen between 0.1 and 0.7 or between −0.7 and −0.1, and the strength of other feature interactions outside submatrices of clusters is chosen between 0.01 and 0.012 or between −0.012 and −0.01. C[i,j] with integers i, jS1 × T1 where S1 and T1 are integers from 1 to 9, i, jS2 × T2 where S2 and T2 are integers from 10 to 18, and i, jS3 × T3 where S3 and T3 are integers from 19 to 27 represents 3 large clusters, C[i,j] with integers i, jS4 × T4 where S4 and T4 are integers from 28 to 33, i, jS5 × T5 where S5 and T5 are integers from 34 to 39, and i, jS6 × T6 where S6 and T6 are integers from 40 to 45 represents 3 medium clusters and C[i,j] with integers i, jS7 × T7 where S7 and T7 are integers from 46 to 48, i, jS8 × T8 where S8 and T8 are integers from 49 to 51, and i, jS9 × T9 where S9 and T9 are integers from 52 to 54 represents 3 small clusters
Figure 2:
Figure 2:
Metabolites in this subnetwork including lysophosphatidylcholines were nonnegatively associated with birth weight. The metabolite nodes such as lysophosphadidylcholines colored green were positively associated with birth weight

Similar articles

References

    1. Ramsay Ian S., Ma Sisi, Fisher Melissa, Loewy Rachel L., Ragland 415 J. Daniel, Niendam Tara, Carter Cameron S., Vinogradoy Sophia, Model selection and prediction of outcomes in recent onset schizophrenia patients who undergo cognitive training, Schizophrenia Research: Cognition Volume 11, March 2018, Pages 1–5, 10.1016/j.scog.2017.10.001 - DOI - PMC - PubMed
    1. Bremner I. (1988) Trace Elements in Man and Animals 6, 1988, ISBN : 978-1-4612-8050-7
    1. Piazza I, Kochanowski K, Cappelletti V, Fuhrer T, Noor E, Sauer U, Picotti P. A Map of Protein-Metabolite Interactions Reveals Principles of Chemical Communication. Cell. 2018. January 11;172(1–2):358–372.e23. doi: 10.1016/j.cell.2017.12.006. - DOI - PubMed
    1. Havens TC, Bezdek JC and Palaniswami M, “Scalable single linkage hierarchical clustering for big data,” 2013 IEEE Eighth International Conference on Intelligent Sensors, Sensor Networks and Information Processing, Melbourne, VIC, 2013, pp. 396–401, doi: 10.1109/ISSNIP.2013.6529823. - DOI
    1. Miyagi A, Takahashi H, Takahara K et al. Principal component and hierarchical clustering analysis of metabolites in destructive weeds; polygonaceous plants. Metabolomics 6, 146–155 (2010). 10.1007/s11306-009-0186-y - DOI

LinkOut - more resources