. 2023 Jan 20;14(2):269.

doi: 10.3390/genes14020269.

GReNaDIne: A Data-Driven Python Library to Infer Gene Regulatory Networks from Gene Expression Data

Pauline Schmitt¹, Baptiste Sorin¹, Timothée Frouté¹, Nicolas Parisot¹, Federica Calevro², Sergio Peignier¹

Affiliations

¹ Univ Lyon, INSA-Lyon, INRAE, BF2i, UMR0203, F-69621 Villeurbanne, France.
² Univ Lyon, INRAE, INSA-Lyon, BF2i, UMR0203, F-69621 Villeurbanne, France.

PMID: 36833196
PMCID: PMC9957546
DOI: 10.3390/genes14020269

GReNaDIne: A Data-Driven Python Library to Infer Gene Regulatory Networks from Gene Expression Data

Pauline Schmitt et al. Genes (Basel). 2023.

. 2023 Jan 20;14(2):269.

doi: 10.3390/genes14020269.

Authors

Pauline Schmitt¹, Baptiste Sorin¹, Timothée Frouté¹, Nicolas Parisot¹, Federica Calevro², Sergio Peignier¹

Affiliations

¹ Univ Lyon, INSA-Lyon, INRAE, BF2i, UMR0203, F-69621 Villeurbanne, France.
² Univ Lyon, INRAE, INSA-Lyon, BF2i, UMR0203, F-69621 Villeurbanne, France.

PMID: 36833196
PMCID: PMC9957546
DOI: 10.3390/genes14020269

Abstract

Context: Inferring gene regulatory networks (GRN) from high-throughput gene expression data is a challenging task for which different strategies have been developed. Nevertheless, no ever-winning method exists, and each method has its advantages, intrinsic biases, and application domains. Thus, in order to analyze a dataset, users should be able to test different techniques and choose the most appropriate one. This step can be particularly difficult and time consuming, since most methods' implementations are made available independently, possibly in different programming languages. The implementation of an open-source library containing different inference methods within a common framework is expected to be a valuable toolkit for the systems biology community. Results: In this work, we introduce GReNaDIne (Gene Regulatory Network Data-driven Inference), a Python package that implements 18 machine learning data-driven gene regulatory network inference methods. It also includes eight generalist preprocessing techniques, suitable for both RNA-seq and microarray dataset analysis, as well as four normalization techniques dedicated to RNA-seq. In addition, this package implements the possibility to combine the results of different inference tools to form robust and efficient ensembles. This package has been successfully assessed under the DREAM5 challenge benchmark dataset. The open-source GReNaDIne Python package is made freely available in a dedicated GitLab repository, as well as in the official third-party software repository PyPI Python Package Index. The latest documentation on the GReNaDIne library is also available at Read the Docs, an open-source software documentation hosting platform. Contribution: The GReNaDIne tool represents a technological contribution to the field of systems biology. This package can be used to infer gene regulatory networks from high-throughput gene expression data using different algorithms within the same framework. In order to analyze their datasets, users can apply a battery of preprocessing and postprocessing tools and choose the most adapted inference method from the GReNaDIne library and even combine the output of different methods to obtain more robust results. The results format provided by GReNaDIne is compatible with well-known complementary refinement tools such as PYSCENIC.

Keywords: Python; bioinformatics; ensemble learning; gene expression; gene regulatory network inference; machine learning; systems biology.

PubMed Disclaimer

Conflict of interest statement

All authors declare no competing interests, either financial or nonfinancial.

Figures

**Figure 1**
The GReNaDIne GRN Inference workflow is organized in three modules: (a) Gene expression preprocessing, including RNA-seq normalization, standardization, and discretization techniques. (b) GRN data-driven inference scoring methods, including techniques based on MI and correlation scores, methods based on regression algorithms, and techniques based on classification algorithms. This second module also incorporates some integration schemes to combine results from different methods to form ensembles. (c) Postprocessing regulatory edges selection tools and GRN evaluation methods. The GRN inference workflow of GreNaDIne simply requires as inputs a gene expression matrix and facultatively a list of regulatory genes (e.g., TFs).

**Figure 2**
Cluster maps representing the gain in (a) AUROC and (b) AUPR values for each inference methods (rows) without using any preprocessing techniques on each benchmark dataset (columns), with respect to the AUROC or AUPR reference score of the DREAM 5 community approach. The family of each method is reported in colors in the left column: regression in green, classification in red, and correlation/MI in blue. The GReNaDIne inference methods exhibited comparable and even better results than those obtained by the DREAM5 community approach. The inner biases and advantages of each method make it suitable for some particular datasets; indeed, the methods performed differently on each particular dataset, and no ever-winning method existed.

**Figure 3**
Cluster maps representing the (a) AUROC and (b) AUPR values for each combination of inference methods (rows) and preprocessing technique (columns); notice that column I represents the identity (i.e., no preprocessing technique applied). The family of each method is reported in colors in the left column: regression in green, classification in red, and correlation/MI in blue. Preprocessing techniques that ensure that genes exhibit comparable levels of expression (i.e., row z-score, EFD, and row K-means) led to better performances on average.

**Figure 4**
(a) AUROC and (b) AUPR scores obtained by ensembles of the inference methods (i.e., BRS•SVM•Ens and BRS•SVM•Ens•Corr), single methods, and DREAM5 community. The ensembles of GReNaDIne that contained BRSr, an SVM-based method, as well as a method based on ensembles of trees or linear regressors (ensemble termed BRSr•SVM•Ens), and also including an extra correlation- or MI-based method (ensemble termed BRSr•SVM•Ens•Corr), revealed to be efficient and robust across different datasets, outperforming single methods as well as the robust DREAM5 community method, with respect to both the AUROC and AUPR scores.

**Figure 5**
Boxplots representing the average (a) AUROC and (b) AUPR obtained by the SVM•BRS•Ens ensembles, with different integration schemes, on all DREAM5 datasets. The integration schemes are arranged in ascending order based on their average AUROC and AUPR scores. All integration schemes had suitable results, but rank-TF and Z-score-TF tended to exhibited lower results compared to Z-score-TG, rank-full, Z-score-full, and Rank-TG. Therefore, it is suggested to use these latter integration schemes.

See this image and copyright information in PMC

Cited by

Aggrephagy-related patterns in tumor microenvironment, prognosis, and immunotherapy for acute myeloid leukemia: a comprehensive single-cell RNA sequencing analysis.
Pan Y, Wang Y, Hu M, Xu S, Jiang F, Han Y, Chen F, Liu Z. Pan Y, et al. Front Oncol. 2023 Jul 17;13:1195392. doi: 10.3389/fonc.2023.1195392. eCollection 2023. Front Oncol. 2023. PMID: 37534253 Free PMC article.
Gene Self-Expressive Networks as a Generalization-Aware Tool to Model Gene Regulatory Networks.
Peignier S, Calevro F. Peignier S, et al. Biomolecules. 2023 Mar 13;13(3):526. doi: 10.3390/biom13030526. Biomolecules. 2023. PMID: 36979461 Free PMC article.
Disruption of Morrbid alleviates autoinflammatory osteomyelitis in Pstpip2-deficient mice.
Huo Q, Ding J, Zhou H, Wang Y, Wang S, He H, Cai LC, Liu J, Dong G, Cai Z. Huo Q, et al. Dis Model Mech. 2025 Jul 1;18(7):dmm052176. doi: 10.1242/dmm.052176. Epub 2025 Jul 7. Dis Model Mech. 2025. PMID: 40503910 Free PMC article.
A distinct immune landscape in anti-synthetase syndrome profiled by a single-cell genomic study.
Ding J, Li Y, Wang Z, Han F, Chen M, Du J, Yang T, Zhang M, Wang Y, Xu J, Wang G, Xu Y, Wu X, Hao J, Liu X, Zhang G, Zhang N, Sun W, Cai Z, Wei W. Ding J, et al. Front Immunol. 2024 Oct 24;15:1436114. doi: 10.3389/fimmu.2024.1436114. eCollection 2024. Front Immunol. 2024. PMID: 39512337 Free PMC article.
Enhance the therapeutic efficacy of human umbilical cord-derived mesenchymal stem cells in prevention of acute graft-versus-host disease through CRISPLD2 modulation.
Xu Q, Wang R, Sui K, Xu Y, Zhou Y, He Y, Hu Z, Wang Q, Xie X, Wang X, Yang S, Zeng L, Zhong JF, Wang Z, Song Q, Zhang X. Xu Q, et al. Stem Cell Res Ther. 2025 May 1;16(1):222. doi: 10.1186/s13287-025-04321-6. Stem Cell Res Ther. 2025. PMID: 40312744 Free PMC article.

See all "Cited by" articles

References

1. Levine M., Davidson E.H. Gene regulatory networks for development. Proc. Natl. Acad. Sci. USA. 2005;102:4936–4949. doi: 10.1073/pnas.0408031102. - DOI - PMC - PubMed
1. Shis D.L., Bennett M.R., Igoshin O.A. Dynamics of bacterial gene regulatory networks. Ann. Rev. Biophys. 2018;47:447–467. doi: 10.1146/annurev-biophys-070317-032947. - DOI - PubMed
1. Chen Y.-C., Desplan C. Gene regulatory networks during the development of the Drosophila visual system. Curr. Top. Dev. Biol. 2020;139:89–125. - PMC - PubMed
1. Shahbazi M.N. Mechanisms of human embryo development: From cell fate to tissue shape and back. Development. 2020:147. doi: 10.1242/dev.190629. - DOI - PMC - PubMed
1. Aibar S., González-Blas C.B., Moerman T., Huynh-Thu V.A., Imrichova H., Hulselmans G., Rambow F., Marine J.C., Geurts P., Aerts J., et al. Scenic: Single-cell regulatory network inference and clustering. Nat. Methods. 2017;14:1083. doi: 10.1038/nmeth.4463. - DOI - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

GReNaDIne: A Data-Driven Python Library to Infer Gene Regulatory Networks from Gene Expression Data

Affiliations

GReNaDIne: A Data-Driven Python Library to Infer Gene Regulatory Networks from Gene Expression Data

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Miscellaneous

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Related information

LinkOut - more resources

Full Text Sources

Miscellaneous