MKL-GRNI: A parallel multiple kernel learning approach for supervised inference of large-scale gene regulatory networks
- PMID: 33817013
- PMCID: PMC7924726
- DOI: 10.7717/peerj-cs.363
MKL-GRNI: A parallel multiple kernel learning approach for supervised inference of large-scale gene regulatory networks
Abstract
High throughput multi-omics data generation coupled with heterogeneous genomic data fusion are defining new ways to build computational inference models. These models are scalable and can support very large genome sizes with the added advantage of exploiting additional biological knowledge from the integration framework. However, the limitation with such an arrangement is the huge computational cost involved when learning from very large datasets in a sequential execution environment. To overcome this issue, we present a multiple kernel learning (MKL) based gene regulatory network (GRN) inference approach wherein multiple heterogeneous datasets are fused using MKL paradigm. We formulate the GRN learning problem as a supervised classification problem, whereby genes regulated by a specific transcription factor are separated from other non-regulated genes. A parallel execution architecture is devised to learn a large scale GRN by decomposing the initial classification problem into a number of subproblems that run as multiple processes on a multi-processor machine. We evaluate the approach in terms of increased speedup and inference potential using genomic data from Escherichia coli, Saccharomyces cerevisiae and Homo sapiens. The results thus obtained demonstrate that the proposed method exhibits better classification accuracy and enhanced speedup compared to other state-of-the-art methods while learning large scale GRNs from multiple and heterogeneous datasets.
Keywords: GRN inference; Gene regulatory networks; Network biology; Systems biology; large-scale GRN.
© 2021 Wani and Raza.
Conflict of interest statement
The authors declare that they have no competing interests.
Figures
References
-
- Butte AJ, Kohane IS. Biocomputing 2000. Singapore: World Scientific; 1999. Mutual information relevance networks: functional genomic clustering using pairwise entropy measurements; pp. 418–429. - PubMed
-
- Chen Z-Y, Fan Z-P. Parallel multiple kernel learning: a hybrid alternating direction method of multipliers. Knowledge and Information Systems. 2014;40(3):673–696. doi: 10.1007/s10115-013-0655-5. - DOI
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Miscellaneous