Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Jan 28:7:e363.
doi: 10.7717/peerj-cs.363. eCollection 2021.

MKL-GRNI: A parallel multiple kernel learning approach for supervised inference of large-scale gene regulatory networks

Affiliations

MKL-GRNI: A parallel multiple kernel learning approach for supervised inference of large-scale gene regulatory networks

Nisar Wani et al. PeerJ Comput Sci. .

Abstract

High throughput multi-omics data generation coupled with heterogeneous genomic data fusion are defining new ways to build computational inference models. These models are scalable and can support very large genome sizes with the added advantage of exploiting additional biological knowledge from the integration framework. However, the limitation with such an arrangement is the huge computational cost involved when learning from very large datasets in a sequential execution environment. To overcome this issue, we present a multiple kernel learning (MKL) based gene regulatory network (GRN) inference approach wherein multiple heterogeneous datasets are fused using MKL paradigm. We formulate the GRN learning problem as a supervised classification problem, whereby genes regulated by a specific transcription factor are separated from other non-regulated genes. A parallel execution architecture is devised to learn a large scale GRN by decomposing the initial classification problem into a number of subproblems that run as multiple processes on a multi-processor machine. We evaluate the approach in terms of increased speedup and inference potential using genomic data from Escherichia coli, Saccharomyces cerevisiae and Homo sapiens. The results thus obtained demonstrate that the proposed method exhibits better classification accuracy and enhanced speedup compared to other state-of-the-art methods while learning large scale GRNs from multiple and heterogeneous datasets.

Keywords: GRN inference; Gene regulatory networks; Network biology; Systems biology; large-scale GRN.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Figure 1
Figure 1. Application architecture of MKL-GRNI (A) Combined kernel (B) Decomposed regulation matrices (C) Parallel distribution and model building (D) Model execution (E) Writing results to shared object.
Figure 2
Figure 2. Genomic data fusion by combining kernel matrices from multiple kernels into a single combined kernel.
Figure 3
Figure 3. Performance metrics for parallel MKL-GRNI algorithm: (A) Speedup, (B) Efficiency, (C) Redundancy, (D) Quality.

References

    1. Albert R. Network inference, analysis, and modeling in systems biology. Plant Cell. 2007;19(11):3327–3338. doi: 10.1105/tpc.107.054700. - DOI - PMC - PubMed
    1. Alioscha-Perez M, Oveneke MC, Sahli H. Svrg-mkl: a fast and scalable multiple kernel learning solution for features combination in multi-class classification problems. IEEE Transactions on Neural Networks and Learning Systems. 2019;31(5):1710–1723. doi: 10.1109/TNNLS.2019.2922123. - DOI - PubMed
    1. Ben-Hur A, Noble WS. Kernel methods for predicting protein-protein interactions. Bioinformatics. 2005;21(Suppl. 1):i38–i46. doi: 10.1093/bioinformatics/bti1016. - DOI - PubMed
    1. Butte AJ, Kohane IS. Biocomputing 2000. Singapore: World Scientific; 1999. Mutual information relevance networks: functional genomic clustering using pairwise entropy measurements; pp. 418–429. - PubMed
    1. Chen Z-Y, Fan Z-P. Parallel multiple kernel learning: a hybrid alternating direction method of multipliers. Knowledge and Information Systems. 2014;40(3):673–696. doi: 10.1007/s10115-013-0655-5. - DOI

LinkOut - more resources