Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Nov 5:18:3335-3343.
doi: 10.1016/j.csbj.2020.10.022. eCollection 2020.

Inductive inference of gene regulatory network using supervised and semi-supervised graph neural networks

Affiliations

Inductive inference of gene regulatory network using supervised and semi-supervised graph neural networks

Juexin Wang et al. Comput Struct Biotechnol J. .

Abstract

Discovering gene regulatory relationships and reconstructing gene regulatory networks (GRN) based on gene expression data is a classical, long-standing computational challenge in bioinformatics. Computationally inferring a possible regulatory relationship between two genes can be formulated as a link prediction problem between two nodes in a graph. Graph neural network (GNN) provides an opportunity to construct GRN by integrating topological neighbor propagation through the whole gene network. We propose an end-to-end gene regulatory graph neural network (GRGNN) approach to reconstruct GRNs from scratch utilizing the gene expression data, in both a supervised and a semi-supervised framework. To get better inductive generalization capability, GRN inference is formulated as a graph classification problem, to distinguish whether a subgraph centered at two nodes contains the link between the two nodes. A linked pair between a transcription factor (TF) and a target gene, and their neighbors are labeled as a positive subgraph, while an unlinked TF and target gene pair and their neighbors are labeled as a negative subgraph. A GNN model is constructed with node features from both explicit gene expression and graph embedding. We demonstrate a noisy starting graph structure built from partial information, such as Pearson's correlation coefficient and mutual information can help guide the GRN inference through an appropriate ensemble technique. Furthermore, a semi-supervised scheme is implemented to increase the quality of the classifier. When compared with established methods, GRGNN achieved state-of-the-art performance on the DREAM5 GRN inference benchmarks. GRGNN is publicly available at https://github.com/juexinwang/GRGNN.

Keywords: Gene regulatory; Graph neural networks; Inductive learning; Machine learning.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

None
Graphical abstract
Fig. 1
Fig. 1
GRGNN scheme. Noisy starting skeletons derived from Pearson’s correlation and mutual information are used to generate the enclosed positive subgraph centering with A and B, and the negative graph centering with C and D. Graph neural networks as the agents are learned independently. An ensemble classifier is built upon these agents and used for the link prediction through graph classification.
Fig. 2
Fig. 2
ROC curve and Precision-Recall curve on balanced training and testing.
Fig. 3
Fig. 3
Performances of GRGNN in different numbers of hops.

Similar articles

Cited by

References

    1. Aliferis C.F., Statnikov A., Tsamardinos I., Mani S., Koutsoukos X.D. Local causal and markov blanket induction for causal discovery and feature selection for classification Part i: algorithms and empirical evaluation. J Mach Learn Res. 2010;11:171–234.
    1. Alon U. Network motifs: theory and experimental approaches. Nat Rev Genet. 2007;8(6):450–461. doi: 10.1038/nrg2102. - DOI - PubMed
    1. Bassett G.W., Tam M.-Y.-S., Knight K. Quantile models and estimators for data analysis. In: Dutter R., Filzmoser P., Gather U., Rousseeuw P.J., editors. Developments in Robust Statistics. Physica-Verlag HD; Heidelberg: 2003. pp. 77–87. - DOI
    1. Bleakley Kevin, Biau Gérard, Vert Jean-Philippe. Supervised reconstruction of biological networks with local models. Bioinformatics. 2007;23(13):i57–i65. doi: 10.1093/bioinformatics/btm204. - DOI - PubMed
    1. Cerulo L., Elkan C., Ceccarelli M. Learning gene regulatory networks from only positive and unlabeled data. BMC Bioinf. 2010;11(1):228. doi: 10.1186/1471-2105-11-228. - DOI - PMC - PubMed

LinkOut - more resources